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PREFACE 


A Note to Students, Teachers, and other Readers 


Thank you for reading this short preface. Allow me to share a few key points about 
the text so that you may better understand what you will find beyond this page. 

This text deals with matrix algebra, as opposed to /inear algebra. Without argu- 
ing semantics, | view matrix algebra as a subset of linear algebra, focused primarily 
on basic concepts and solution techniques. There is little formal development of the- 
ory and abstract concepts are avoided. This is akin to the master carpenter teaching 
his apprentice how to use a hammer, saw and plane before teaching how to make a 
cabinet. 

This book is intended to be read. Each section starts with “AS YOU READ” questions 
that the reader should be able to answer after a careful reading of the section even if 
all the concepts of the section are not fully understood. | use these questions as a daily 
reading quiz for my students. The text is written in a conversational manner, hopefully 
resulting in a text that is easy (and even enjoyable) to read. 

Many examples are given to illustrate concepts. When a concept is first learned, 
| try to demonstrate all the necessary steps so mastery can be obtained. Later, when 
this concept is now a tool to study another idea, certain steps are glossed over to focus 
on the new material at hand. | would suggest that technology be employed in a similar 
fashion. 

This text is “open.” If it nearly suits your needs as an instructor, but falls short in 
any way, feel free to make changes. | will readily share the source files (and help you 
understand them) and you can do with them as you wish. | would find such a process 
very rewarding on my own end, and | would enjoy seeing this text become better and 
even eventually grow into a separate linear algebra text. | do ask that the Creative 
Commons copyright be honored, in that any changes acknowledge this as a source 
and that it only be used non commercially. 

This is the third edition of the Fundamentals of Matrix Algebra text. | had not 
intended a third edition, but it proved necessary given the number of errors found in 
the second edition and the other opportunities found to improve the text. It varies 
from the first and second editions in mostly minor ways. | hope this edition is “stable;” 
| do not want a fourth edition anytime soon. 

Finally, | welcome any and all feedback. Please contact me with suggestions, cor- 
rections, etc. 

Sincerely, 

Gregory Hartman 
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SYSTEMS OF LINEAR EQUATIONS 


You have probably encountered systems of linear equations before; you can proba- 
bly remember solving systems of equations where you had three equations, three 
unknowns, and you tried to find the value of the unknowns. In this chapter we will 
uncover some of the fundamental principles guiding the solution to such problems. 

Solving such systems was a bit time consuming, but not terribly difficult. So why 
bother? We bother because linear equations have many, many, many applications, 
from business to engineering to computer graphics to understanding more mathemat- 
ics. And not only are there many applications of systems of linear equations, on most 
occasions where these systems arise we are using far more than three variables. (En- 
gineering applications, for instance, often require thousands of variables.) So getting 
a good understanding of how to solve these systems effectively is important. 


But don’t worry; we’ll start at the beginning. 


1.1 Introduction to Linear Equations 


1. What is one of the annoying habits of mathematicians? 
2. What is the difference between constants and coefficients? 


3. Cana coefficient in a linear equation be 0? 
We’ll begin this section by examining a problem you probably already know how 
to solve. 


Example 1 Suppose a jar contains red, blue and green marbles. You are told 
that there are a total of 30 marbles in the jar; there are twice as many red marbles as 
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green ones; the number of blue marbles is the same as the sum of the red and green 
marbles. How many marbles of each color are there? 


SOLUTION We could attempt to solve this with some trial and error, and we’d 
probably get the correct answer without too much work. However, this won’t lend 
itself towards learning a good technique for solving larger problems, so let’s be more 
mathematical about it. 

Let’s let rrepresent the number of red marbles, and let b and g denote the number 
of blue and green marbles, respectively. We can use the given statements about the 
marbles in the jar to create some equations. 

Since we know there are 30 marbles in the jar, we know that 


r+b+g=30. (1.1) 


Also, we are told that there are twice as many red marbles as green ones, so we know 
that 
r= 2g. (1.2) 


Finally, we know that the number of blue marbles is the same as the sum of the red 
and green marbles, so we have 
b=r-+g. (1.3) 


From this stage, there isn’t one “right” way of proceeding. Rather, there are many 
ways to use this information to find the solution. One way is to combine ideas from 
equations 1.2 and 1.3; in 1.3 replace r with 2g. This gives us 


b=2g+ 9 = 3g. (1.4) 


We can then combine equations 1.1, 1.2 and 1.4 by replacing rin 1.1 with 2g as we did 
before, and replacing b with 3g to get 


r+b+g=30 
29+ 39+ g9 = 30 
6g = 30 

g=5 (1.5) 


We can now use equation 1.5 to find r and b; we know from 1.2 that r = 2g = 10 
and then since r+ b+ g = 30, we easily find that b = 15. 


Mathematicians often see solutions to given problems and then ask “What if.. .?” 
It’s an annoying habit that we would do well to develop — we should learn to think 
like a mathematician. What are the right kinds of “what if” questions to ask? Here’s 
another annoying habit of mathematicians: they often ask “wrong” questions. That 
is, they often ask questions and find that the answer isn’t particularly interesting. But 
asking enough questions often leads to some good “right” questions. So don’t be afraid 
of doing something “wrong;” we mathematicians do it all the time. 

So what is a good question to ask after seeing Example 1? Here are two possible 
questions: 
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1. Did we really have to call the red balls “r’? Could we call them “q’”? 
2. What if we had 60 balls at the start instead of 30? 


Let’s look at the first question. Would the solution to our problem change if we 
called the red balls g? Of course not. At the end, we’d find that q = 10, and we would 
know that this meant that we had 10 red balls. 

Now let’s look at the second question. Suppose we had 60 balls, but the other 
relationships stayed the same. How would the situation and solution change? Let’s 
compare the “orginal” equations to the “new” equations. 


Original New 
r+b+g=30/r+b+g=60 
r=2g r=2g 
b=r+g b=r+g 


By examining these equations, we see that nothing has changed except the first 
equation. It isn’t too much of a stretch of the imagination to see that we would solve 
this new problem exactly the same way that we solved the original one, except that 
we’d have twice as many of each type of ball. 

Aconclusion from answering these two questions is this: it doesn’t matter what we 
call our variables, and while changing constants in the equations changes the solution, 
they don’t really change the method of how we solve these equations. 

In fact, it is a great discovery to realize that all we care about are the constants and 
the coefficients of the equations. By systematically handling these, we can solve any 
set of linear equations in a very nice way. Before we go on, we must first define what 
a linear equation is. 


Definition 1 Linear Equation 


A linear equation is an equation that can be written in the 
form 
4X1 + 2X2 +++ + AnXp = C 


where the x; are variables (the unknowns), the a; are 
coefficients, and c is a constant. 


A system of linear equations is a set of linear equations that 
involve the same variables. 


A solution to a system of linear equations is a set of values 
for the variables x; such that each equation in the system is 
satisfied. 


So in Example 1, when we answered “how many marbles of each color are there?,” 
we were also answering “find a solution to a certain system of linear equations.” 
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The following are examples of linear equations: 


2x + 3y — 7z= 29 
7 
t+ 7X2 + x3 —X4 + 17x, = V¥—10 
y, + 14y,+4=yo+13-y, 


3t 
Vir + 1s + ere cos(45°) 


Notice that the coefficients and constants can be fractions and irrational numbers 
(like 7, y/—10 and cos(45°)). The variables only come in the form of a;x;; that is, just 
one variable multiplied by a coefficient. (Note that 3 = 2t, just a variable multiplied 
by a coefficient.) Also, it doesn’t really matter what side of the equation we put the 
variables and the constants, although most of the time we write them with the vari- 
ables on the left and the constants on the right. 

We would not regard the above collection of equations to constitute a system of 
equations, since each equation uses differently named variables. An example of a sys- 


tem of linear equations is 


Xy —X2 +X34+xX%, = 1 
2x, + 3X2 +Xq, = 25 


X2 + X3 = 10 


It is important to notice that not all equations used all of the variables (it is more 
accurate to say that the coefficients can be 0, so the last equation could have been 
written as Ox, + x. + x3 + 0x, = 10). Also, just because we have four unknowns does 
not mean we have to have four equations. We could have had fewer, even just one, 
and we could have had more. 

To get a better feel for what a linear equation is, we point out some examples of 
what are not linear equations. 


2xay+z=1 
5x* + 2y° = 100 


1 
= +J/y+24z =3 
x 


sin? + cos? X2 = 29 
2X1 + In x2 = 13 


The first example is not a linear equation since the variables x and y are multi- 
plied together. The second is not a linear equation because the variables are raised 
to powers other than 1; that is also a problem in the third equation (remember that 
1/x = x1 and \/x = x‘/2). Our variables cannot be the argument of function like sin, 
cos or In, nor can our variables be raised as an exponent. 


1.2 Using Matrices To Solve Systems of Linear Equations 


At this stage, we have yet to discuss how to efficiently find a solution to a system 
of linear equations. That is a goal for the upcoming sections. Right now we focus on 
identifying linear equations. It is also useful to “limber” up by solving a few systems of 
equations using any method we have at hand to refresh our memory about the basic 
process. 


Exercises 1.1 


15. A farmer looks out his window at his 
-Xa+ty+t=1 chickens and pigs. He tells his daugh- 
14.9 = 3cos(y) — 5z ter that he sees 62 heads and 190 legs. 


a How many chickens and pigs does the 


farmer have? 


In Exercises 1 — 10, state whether or not the ym X - wy = 3 
given equation is linear. 3x + by = 8 
1. xt+y+z=10 x y z = 1 
2. xy+yz+xz=1 13. 2x + 6y -—- z = -4 
4x Sy wy =O 
3 3x+9=3y—5z+x-7 
4. /S5ytomx=-l x + y - z= 1 
14. 2x + y = 4 
5. (x—1)(x+1) =0 a BS & 
6. xe tx3 = 25 
7 
8 
9 


. cos(15)y+74=-1 


10. 2°+ 2’ = 16 
16. A lady buys 20 trinkets at a yard sale. 


In Exercises 11 — 14, solve the system of linear The cost of each trinket is either $0.30 

equations. or $0.65. If she spends $8.80, how 

ja. @ fy = si many of each type of trinket does she 
“2x -— By = 8 buy? 


1.2 Using Matrices To Solve Systems of Linear Equations 


AS YOU READ 


1. What is remarkable about the definition of a matrix? 
2. Vertical lines of numbers in a matrix are called what? 
3. Ina matrix A, the entry as3 refers to which entry? 


4. What is an augmented matrix? 


In Section 1.1 we solved a linear system using familiar techniques. Later, we com- 
mented that in the linear equations we formed, the most important information was 
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the coefficients and the constants; the names of the variables really didn’t matter. In 
Example 1 we had the following three equations: 


r+b+g=30 
r=2g 
b=r+qQg 


Let’s rewrite these equations so that all variables are on the left of the equal sign 
and all constants are on the right. Also, for a bit more consistency, let’s list the variables 
in alphabetical order in each equation. Therefore we can write the equations as 


bo+ g + r = 30 
-—- 29 + r= 0. (1.6) 
—-b + g + r= 0 


As we mentioned before, there isn’t just one “right” way of finding the solution to 
this system of equations. Here is another way to do it, a way that is a bit different from 
our method in Section 1.1. 

First, lets add the first and last equations together, and write the result as a new 
third equation. This gives us: 


b+ g + r = 30 
—- 29 + r= 0. 
29 + 2r = 30 


A nice feature of this is that the only equation with a b in it is the first equation. 
Now let’s multiply the second equation by —}. This gives 


b+ g + r = 30 
g -— 1/27 = 0. 
29 + #=2r = 30 


Let’s now do two steps in a row; our goal is to get rid of the g’s in the first and third 
equations. In order to remove the g in the first equation, let’s multiply the second 
equation by —1 and add that to the first equation, replacing the first equation with 
that sum. To remove the g in the third equation, let’s multiply the second equation by 
—2 and add that to the third equation, replacing the third equation. Our new system 
of equations now becomes 


b + 3/2r = 30 
g — 1/2r = 0. 
3r = 30 


Clearly we can multiply the third equation by ; and find that r = 10; let’s make 
this our new third equation, giving 


b + 3/2r = 30 
g — 1/2r = 0. 
r = 10 
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Now let’s get rid of the r’s in the first and second equation. To remove the r in the 
first equation, let’s multiply the third equation by -3 and add the result to the first 
equation, replacing the first equation with that sum. To remove the r in the second 
equation, we can multiply the third equation by $ and add that to the second equation, 
replacing the second equation with that sum. This gives us: 


b = 15 
g = 5 . 
r = 10 
Clearly we have discovered the same result as when we solved this problem in Section 
1.1. 

Now again revisit the idea that all that really matters are the coefficients and the 
constants. There is nothing special about the letters b, g and r; we could have used x, 
y and z or x1, X2 and x3. And even then, since we wrote our equations so carefully, we 
really didn’t need to write the variable names at all as long as we put things “in the 
right place.” 

Let’s look again at our system of equations in (1.6) and write the coefficients and 
the constants in a rectangular array. This time we won’t ignore the zeros, but rather 
write them out. 


bo + g + r = 30 Hi 1 1 30 
—- 29 + r= 0 4 Oo -2 1 O 
= ee g + fr = 0 —1 1 1 0 


Notice how even the equal signs are gone; we don’t need them, for we know that the 
last column contains the coefficients. 

We have just created a matrix. The definition of matrix is remarkable only in how 
unremarkable it seems. 


Definition 2 Matrix 


A matrix is a rectangular array of numbers. 


The horizontal lines of numbers form rows and the vertical 
lines of numbers form columns. A matrix with m rows and 
ncolumns is said to be an m x n matrix (“an m by n matrix”). 


The entries of an m x n matrix are indexed as follows: 


O11 a13,- ++" G1n 
21 op Chip 
31 sts Q3n 


Om1 Om3 ins Omn 


That is, a2 means “the number in the third row and second 
column.” 
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In the future, we’ll want to create matrices with just the coefficients of a system 
of linear equations and leave out the constants. Therefore, when we include the con- 
stants, we often refer to the resulting matrix as an augmented matrix. 

We can use augmented matrices to find solutions to linear equations by using es- 
sentially the same steps we used above. Every time we used the word “equation” 
above, substitute the word “row,” as we show below. The comments explain how we 
get from the current set of equations (or matrix) to the one on the next line. 

We can use a shorthand to describe matrix operations; let R1, Rp represent “row 1” 
and “row 2,” respectively. We can write “add row 1 to row 3, and replace row 3 with 
that sum” as “R; + R3 — R3.” The expression “R; <> R2” means “interchange row 1 


and row 2.” 
bo + g + r = 30 1 1 1 30 
—- 2g + r= 0 Oo -2 1 0 
b g r = 0 —-1 1 1 0 


Replace row 3 with the sum of 
rows 1 and 3. 
(Ri + R3—-> R3) 


Replace equation 3 with the sum 
of equations 1 and 3 


b+ g + r = 30 1 1 1 30 
—- 2g + r= 0 Oo —-2 1 #0 
29 + 2r = 30 0 2 2 30 
‘ a Multiply row 2 by —$ 
Multiply equation 2 by — 5 (—1R, + Ro) 
b+ g + r = 30 1 1 1 £30 
g + -1/2r = 0 01 -} 0 
29 + ar = 30 0 2 2. 30 
Replace equation 1 with the sum Replace row 1 with the sum of 
of (—1) times equation 2 plus (—1) times row 2 plus row 1 
equation 1; (—Rz + Ri > R:1); 
Replace equation 3 with the sum Replace row 3 with the sum of 
of (—2) times equation 2 plus (—2) times row 2 plus row 3 
equation 3 (—2R2 + R3 — R3) 
b+ Sy 2r = 30 10 2 30 
g -— 1/27 = 0 01 -} 0 
3r = 30 0 0 3. 30 


; 1 
Multiply equation 3 by + mee cory ‘i 3 
3R3 3 


1.2 


b + 3/2r = 30 
g — 1/2r = 0O 
r = 10 


Replace equation 2 with the sum 
Vas . 
of 5 times equation 3 plus 
equation 2; 
Replace equation 1 with the sum 
of — 3 times equation 3 plus 


equation 1 
b = 15 
g = 5 
r = 10 
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10 2 30 
01 -} O 
00 1 10 


Replace row 2 with the sum of $ 
times row 3 plus row 2 
($R3 + R2 — Ro); 
Replace row 1 with the sum of 
—3 times row 3 plus row 1 
(—3R3 + Ri > R1) 


The final matrix contains the same solution information as we have on the left in 
the form of equations. Recall that the first column of our matrices held the coefficients 
of the b variable; the second and third columns held the coefficients of the g and r 
variables, respectively. Therefore, the first row of the matrix can be interpreted as 
“b + Og + Or = 15,” or more concisely, “b = 15.” 

Let’s practice this manipulation again. 


Example 2 Find a solution to the following system of linear equations by si- 
multaneously manipulating the equations and the corresponding augmented matri- 
ces. 
xX + xX + x3 = O 
2X1 + 2X2 + X3 = 0 
—1x, + X2 = 2X3 — ae 
SOLUTION We'll first convert this system of equations into a matrix, then we’ll 


proceed by manipulating the system of equations (and hence the matrix) to find a 
solution. Again, there is not just one “right” way of proceeding; we’ll choose a method 
that is pretty efficient, but other methods certainly exist (and may be “better”!). The 
method use here, though, is a good one, and it is the method that we will be learning 


in the future. 


The given system and its corresponding augmented matrix are seen below. 


Original system of equations 


Xi + XxX + *%3 = O 
2X1 + 2x. + = x3 = 0 
—1x, + x —- 2x3 = 2 


Corresponding matrix 


We'll proceed by trying to get the x; out of the second and third equation. 
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Replace equation 2 with the sum 
of (—2) times equation 1 plus 
equation 2; 

Replace equation 3 with the sum 
of equation 1 and equation 3 


Xi + XX + %*% = O 
—-X3 = 0 
2X2 = X3 = 2 


Replace row 2 with the sum of 
(—2) times row 1 plus row 2 
(—2R1 + Ro — R2); 
Replace row 3 with the sum of 
row 1 and row 3 
(Ri + R3 > R3) 


11 1 £40 
0 0 -1 0 
0 2 -1 2 


Notice that the second equation no longer contains x2. We'll exchange the order of 
the equations so that we can follow the convention of solving for the second variable 


in the second equation. 


Interchange equations 2 and 3 


X + - + %&% j= 0 
2X? => X3 = 2 
—-X3 = 0 


Multiply equation 2 by 5 


Xi + X% + %*%3 = O 
X2 5X3 = 1 
—X3 = 0 


Multiply equation 3 by —1 


xX, t+ xX + x3 = O 
xX2  — 3X3 = 1 
X3 = 0 


Interchange rows 2 and 3 
Ro © R3 


11 1 £40 
0 2 -1 2 
0 0 -1 0 


Multiply row 2 by $ 


($R2 > R2) 
11 1 0 
01-5 1 
0 0 -1 O 
Multiply row 3 by —1 
(—1R3 — R3) 
11 1 0 
01-1 
00 1 40 


Notice that the last equation (and also the last row of the matrix) show that x3 = 0. 
Knowing this would allow us to simply eliminate the x3 from the first two equations. 
However, we will formally do this by manipulating the equations (and rows) as we have 


previously. 


Replace equation 1 with the sum 
of (—1) times equation 3 plus 
equation 1; 

Replace equation 2 with the sum 
of 5 times equation 3 plus 


equation 2 
Xi + = X2 = 0 
10 X2 = 1 
x34 = 0 


Replace row 1 with the sum of 
(—1) times row 3 plus row 1 
(—R3 + Ri > R1); 
Replace row 2 with the sum of ; 
times row 3 plus row 2 
(FR3 + R2—> R2) 


oOOR 
OrPFr 
ne) 
OrRO 
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Notice how the second equation shows that x2 = 1. All that remains to do is to 


solve for x1. 


Replace equation 1 with the sum 
of (—1) times equation 2 plus 
equation 1 


—1 
1 
x3 = 0 


Xt => 
x2 


Replace row 1 with the sum of 
(—1) times row 2 plus row 1 
(—Ro + Ri > R1) 


Obviously the equations on the left tell us that x; = —1, x2 = 1 and x3 = O, and 
notice how the matrix on the right tells us the same information. 


Exercises 1.2 


In Exercises 1 — 4, convert the given system of 
linear equations into an augmented matrix. 


3x + 4y + 52 = 7 
1-x + y -—- 32 = 1 
2x 2y 3z = 
2x + Sy — 62 = 2 
2 9x — 8z = 10 
—2x + 4y + 2z = -7 
Xi 3x2 = 4x3 a 5X4 = 17 
3 —xXi + 4x3 ae 8X4 => 1 
2X1 3x2 4x3 + 5X4 => 
3x1 _ 2X2 => 4 
4. 2X1 => 3 
—xXi + 9x2 => 8 


5x1 = 7X2 => 13 


In Exercises 5 — 9, convert the given aug- 
mented matrix into a system of linear equa- 
tions. Use the variables x, x2, etc. 


f1 2 3 
=: |-1 3 | 
[-3 4 7 
[0 <4 _) 
f1 1 -1 -1 2 
"(ea 3 5S 2 
1000 2 
g (0 2 oO 1 
“lo 010 5 
0001 3 


103107 2 
013 2 0 °5 


In Exercises 10 — 15, perform the given row 
operations on A, where 


9. 


ey 
A=|0 4 -2 
5 0 3 


10. —1R; > Ri 

11. R2 +> Rs 

12. Ri +R2 > Ro 
13. 2R, + R3 > Rs 
14. $R, > Ro 

15. —3Ri + R3 > Rs 


A matrix A is given below. In Exercises 16 — 
20, a matrix B is given. Give the row opera- 
tion that transforms A into B. 


16. B= 


17. B= 


18. B= 


PRPW PNP PNB 
NOW NPP NOR 
WRN WNP WNPB 


11 
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101 og, 2X + By = 2 
19. B=;1 1 1 “=x + y = 1 
1 2 3 
1 11 ax + 3y = 2 
20.B=|]1 0 1 PtP oie ie. ee ae, 
0 2 2 
In Exercises 21 — 26, rewrite the system of 5 5 = 
equations in matrix form. Find the solution to =e ia man 
; : 2 25. X2 = 1 
the linear system by simultaneously manipu- 3 = % 
lating the equations and the matrix. Tan = 
a2. FS 
“2x -— 3y = 1 — 5x. + 2x3 = -11 
26. Xi ae 2X3 => 15 
= 3x2 ae X3 = —8 
2x + 4y = 10 
22. 
—y —+- y = 4 
1.3. Elementary Row Operations and Gaussian Elimina- 


tion 


AS YOU READ 


1. 


2. 


Give two reasons why the Elementary Row Operations are called “Elementary.” 


T/F: Assuming a solution exists, all linear systems of equations can be solved 
using only elementary row operations. 


. Give one reason why one might not be interested in putting a matrix into re- 


duced row echelon form. 


Identify the leading 1s in the following matrix: 


100 1 
01410 
001 1 
000 0 


. Using the “forward” and “backward” steps of Gaussian elimination creates lots 


of making computations easier. 


In our examples thus far, we have essentially used just three types of manipulations 
in order to find solutions to our systems of equations. These three manipulations are: 


1. 
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Add a scalar multiple of one equation to a second equation, and replace the 
second equation with that sum 
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2. Multiply one equation by a nonzero scalar 


3. Swap the position of two equations in our list 


We saw earlier how we could write all the information of a system of equations in 
a matrix, so it makes sense that we can perform similar operations on matrices (as we 
have done before). Again, simply replace the word “equation” above with the word 
“row.” 

We didn’t justify our ability to manipulate our equations in the above three ways; 
it seems rather obvious that we should be able to do that. In that sense, these oper- 
ations are “elementary.” These operations are elementary in another sense; they are 
fundamental — they form the basis for much of what we will do in matrix algebra. Since 


these operations are so important, we list them again here in the context of matrices. 


Key Idea 1 Elementary Row Operations 


1. Add a scalar multiple of one row to another row, and 
replace the latter row with that sum 


. Multiply one row by a nonzero scalar 


. Swap the position of two rows 


Given any system of linear equations, we can find a solution (if one exists) by using 
these three row operations. Elementary row operations give us a new linear system, 
but the solution to the new system is the same as the old. We can use these opera- 
tions as much as we want and not change the solution. This brings to mind two good 
questions: 


1. Since we can use these operations as much as we want, how do we know when 
to stop? (Where are we supposed to “go” with these operations?) 


2. Is there an efficient way of using these operations? (How do we get “there” the 
fastest?) 


We’ll answer the first question first. Most of the time? we will want to take our 
original matrix and, using the elementary row operations, put it into something called 
reduced row echelon form.” This is our “destination,” for this form allows us to readily 
identify whether or not a solution exists, and in the case that it does, what that solution 
is. 

In the previous section, when we manipulated matrices to find solutions, we were 
unwittingly putting the matrix into reduced row echelon form. However, not all solu- 
tions come in such a simple manner as we’ve seen so far. Putting a matrix into reduced 


lunless one prefers obfuscation to clarification 
2Some texts use the term reduced echelon form instead. 
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row echelon form helps us identify all types of solutions. We’ll explore the topic of un- 
derstanding what the reduced row echelon form of a matrix tells us in the following 
sections; in this section we focus on finding it. 


Definition 3 Reduced Row Echelon Form 


A matrix is in reduced row echelon form if its entries satisfy 
the following conditions. 


1. The first nonzero entry in each rowis a 1 (called a /ead- 
ing 1). 


. Each leading 1 comes in a column to the right of the 


leading 1s in rows above it. 
. All rows of all Os come at the bottom of the matrix. 


. Ifacolumn contains a leading 1, then all other entries 
in that column are O. 


A matrix that satisfies the first three conditions is said to be 
in row echelon form. 


Example 3 Which of the following matrices is in reduced row echelon form? 
1 0 101 
a) e | P) E 1 | 
0 0 110 
af °| ai | 0 | 
1001 12 00 
e)}0 0 0 0 f);}0 0 3 0 
001 3 000 4 
0123 0 4 1 1 0 
g)}0 0001 5 h)|}O 1 0 
00000 0 001 
SOLUTION The matrices in a), b), c), d) and g) are all in reduced row echelon 


form. Check to see that each satisfies the necessary conditions. If your instincts were 
wrong on some of these, correct your thinking accordingly. 

The matrix in e) is not in reduced row echelon form since the row of all zeros is 
not at the bottom. The matrix in f) is not in reduced row echelon form since the first 
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nonzero entries in rows 2 and 3 are not 1. Finally, the matrix in h) is not in reduced 
row echelon form since the first entry in column 2 is not zero; the second 1 in column 
2 is a leading one, hence all other entries in that column should be 0. 

We end this example with a preview of what we'll learn in the future. Consider 
the matrix in b). If this matrix came from the augmented matrix of a system of linear 
equations, then we can readily recognize that the solution of the system is x; = 1 and 
Xz = 2. Again, in previous examples, when we found the solution to a linear system, 
we were unwittingly putting our matrices into reduced row echelon form. 


We began this section discussing how we can manipulate the entries in a matrix 
with elementary row operations. This led to two questions, “Where do we go?” and 
“How do we get there quickly?” We’ve just answered the first question: most of the 
time we are “going to” reduced row echelon form. We now address the second ques- 
tion. 

There is no one “right” way of using these operations to transform a matrix into 
reduced row echelon form. However, there is a general technique that works very well 
in that it is very efficient (so we don’t waste time on unnecessary steps). This technique 
is called Gaussian elimination. \t is named in honor of the great mathematician Karl 
Friedrich Gauss. 

While this technique isn’t very difficult to use, it is one of those things that is eas- 
ier understood by watching it being used than explained as a series of steps. With this 
in mind, we will go through one more example highlighting important steps and then 
we'll explain the procedure in detail. 


Example 4 Put the augmented matrix of the following system of linear equa- 
tions into reduced row echelon form. 


—3x, — 3x. + 9x3 = 12 
2x, + 22x. -—- 4%3 = -2 
—2x> = 4x3 = -8 
SOLUTION We start by converting the linear system into an augmented ma- 


trix. 


9-4. e ae 
a a 
GO a ae, 8 


Our next step is to change the entry in the box to a 1. To do this, let’s multiply row 
1 by —}. 


1 1 -3 -4 
0 -2 -4 -8 
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We have now created a /eading 1; that is, the first entry in the first row is a 1. 
Our next step is to put zeros under this 1. To do this, we’ll use the elementary row 
operation given below. 


1 1 -3 -4 
—2Ri + R2 > Ro (@) 0 2 6 
0 -—2 -4 -8 


Once this is accomplished, we shift our focus from the leading one down one row, 
and to the right one column, to the position that is boxed. We again want to put a 1 
in this position. We can use any elementary row operations, but we need to restrict 
ourselves to using only the second row and any rows below it. Probably the simplest 
thing we can do is interchange rows 2 and 3, and then scale the new second row so 
that there is a 1 in the desired position. 


1 1 —3 -4 
R2 > R3 O |-2| -—4 -8 


1 1-3 —4 
—3R, — R2 01 2 4 
0 0 |2 6 


We have now created another leading 1, this time in the second row. Our next 
desire is to put zeros underneath it, but this has already been accomplished by our 
previous steps. Therefore we again shift our attention to the right one column and 
down one row, to the next position put in the box. We want that to be a 1. A simple 
scaling will accomplish this. 


11-3 -4 
5R3 — R3 01 2 4 
00 1 3 


This ends what we will refer to as the forward steps. Our next task is to use the 
elementary row operations and go back and put zeros above our leading 1s. This is 
referred to as the backward steps. These steps are given below. 


3R3 + Ri +R Lis > 
3 1 1 

010 -2 
—2R R R 

3 + Ro — Ra 001 3 

100 7 

—R2 +R1 > Ri 01 0 -2 

001 3 

It is now easy to read off the solution as x; = 7, x» = —2 and x3 = 3. 
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We now formally explain the procedure used to find the solution above. As you 
read through the procedure, follow along with the example above so that the expla- 
nation makes more sense. 


Forward Steps 


1. Working from left to right, consider the first column that isn’t all zeros that hasn’t 
already been worked on. Then working from top to bottom, consider the first 
row that hasn’t been worked on. 


2. If the entry in the row and column that we are considering is zero, interchange 
rows with a row below the current row so that that entry is nonzero. If all entries 
below are zero, we are done with this column; start again at step 1. 


3. Multiply the current row by a scalar to make its first entry a 1 (a leading 1). 


4. Repeatedly use Elementary Row Operation 1 to put zeros underneath the lead- 
ing one. 


5. Go back to step 1 and work on the new rows and columns until either all rows 
or columns have been worked on. 


If the above steps have been followed properly, then the following should be true 
about the current state of the matrix: 


1. The first nonzero entry in each row is a 1 (a leading 1). 
2. Each leading 1 is in a column to the right of the leading 1s above it. 


3. All rows of all zeros come at the bottom of the matrix. 


Note that this means we have just put a matrix into row echelon form. The next 
steps finish the conversion into reduced row echelon form. These next steps are re- 
ferred to as the backward steps. These are much easier to state. 


Backward Steps 


1. Starting from the right and working left, use Elementary Row Operation 1 re- 
peatedly to put zeros above each leading 1. 


The basic method of Gaussian elimination is this: create leading ones and then use 
elementary row operations to put zeros above and below these leading ones. We can 
do this in any order we please, but by following the “Forward Steps” and “Backward 
Steps,” we make use of the presence of zeros to make the overall computations easier. 
This method is very efficient, so it gets its own name (which we’ve already been using). 
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Definition 4 Gaussian Elimination 


Gaussian elimination is the technique for finding the re- 
duced row echelon form of a matrix using the above pro- 
cedure. It can be abbreviated to: 


. Create a leading 1. 


. Use this leading 1 to put zeros underneath it. 


. Repeat the above steps until all possible rows have 
leading 1s. 


. Put zeros above these leading 1s. 


Let’s practice some more. 


Example 5 Use Gaussian elimination to put the matrix A into reduced row ech- 
elon form, where 


SOLUTION We start by wanting to make the entry in the first column and first 
row a 1 (a leading 1). To do this we’ll scale the first row by a factor of —}. 


121 £°5 0 
—3R, > Ri 241 9 -2 
3 6 1 13 -4 


Next we need to put zeros in the column below this newly formed leading 1. 
1 2 1 5 0 


Oo j;O| -1 -1 -2 
0 O -2 -2 -4 


—2R1 + R2 > R2 
—3R, + R3 > R3 


Our attention now shifts to the right one column and down one row to the position 
indicated by the box. We want to put a 1 in that position. Our only options are to either 
scale the current row or to interchange rows with a row below it. However, in this case 
neither of these options will accomplish our goal. Therefore, we shift our attention to 
the right one more column. 

We want to put a 1 where there is a—1. A simple scaling will accomplish this; once 
done, we will put a 0 underneath this leading one. 


1 2 1 5 0 
—R2 — R2 0 0 1 1 2 
0 0 -2 -2 -—4 
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121 5 O 
2R2 + R3 — R3 001 1 2 
0 0 0 |0} O 


Our attention now shifts over one more column and down one row to the position 
indicated by the box; we wish to make this a 1. Of course, there is no way to do this, 
so we are done with the forward steps. 

Our next goal is to put a O above each of the leading 1s (in this case there is only 
one leading 1 to deal with). 


1204 -2 
—R2 + Ri > Ri 001 1 2 
0000 0 
This final matrix is in reduced row echelon form. 
Example 6 Put the matrix 
121 3 
21141 
3 32 1 
into reduced row echelon form. 
SOLUTION Here we will show all steps without explaining each one. 
2R R2 > R 7 5 . 
es 17 AN2 2 
—3R, + R3 > Rs St 3 


0 -3 -1 -8 


1 2 1 3 
RoR 0 1 1/3 5/3 
Oo -3 -1 -8 


12 1 3 

3R2 + R3 — R3 0 1 1/3 5/3 

00 0 -3 

12 1 3 

—3R3 + R3 0 1 1/3 5/3 

00 O 1 

spinon ETL 
—2R3+R2 OR 

ee 000 1 

1 0 1/3 0 

—2R, + Ri > Ri 0 1 1/3 O 

00 0 1 
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The last matrix in the above example is in reduced row echelon form. If one thinks 
of the original matrix as representing the augmented matrix of a system of linear equa- 
tions, this final result is interesting. What does it mean to have a leading one in the 
last column? We'll figure this out in the next section. 


Example 7 Put the matrix A into reduced row echelon form, where 


SOLUTION We'll again show the steps without explanation, although we will 
stop at the end of the forward steps and make a comment. 


i ajo a2. 2 


$Ri > Ri 1 —1 2 12 
2 2 —1 9 
pete ¢ we ste: 2 
ANY 2 2 
0 3/2 5/2 10 
—2R R R 
1+ R3 — R3 0 1 0 5 
i. 47 <2 9 
—2Ry + Ry 0 1 5/3 —20/3 
Oo 41 0 5 
i ae <1 2 
0 0 5/3 35/3 
i, 47> => 2 
se 0 1 5/3 —20/3 
0 O 1 7 


Let’s take a break here and think about the state of our linear system at this mo- 
ment. Converting back to linear equations, we now know 


Xy +1/2x, —1/2x3 = 2 
X2 — 5/3x3 = —20/3 . 
x3 0 = 7 


Since we know that x3 = 7, the second equation turns into 
x2 — (5/3)(7) = —20/3, 


telling us that x. = 5. 
Finally, knowing values for x, and x3 lets us substitute in the first equation and find 


x1 + (1/2)(5) — (1/2)(7) = 2, 
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SOX; = 3. 


This process of substituting known values back into other equations is called back 
substitution. This process is essentially what happens when we perform the backward 
steps of Gaussian elimination. We make note of this below as we finish out finding the 
reduced row echelon form of our matrix. 


2R3 + Ro — Ro 1 1/2 —1/2 2 

(knowing x3 = 7 allows us Oo 1 0) 5 

to find x. = 5) 0 8600 1 7 
Timor e 
(inowing a = 5 and - =7 ee 
0041 7 


allows us to find x; = 3) 


We did our operations slightly “out of order” in that we didn’t put the zeros above 
our leading 1 in the third column in the same step, highlighting how back substitution 
works. 


In all of our practice, we’ve only encountered systems of linear equations with ex- 
actly one solution. Is this always going to be the case? Could we ever have systems 
with more than one solution? If so, how many solutions could there be? Could we 
have systems without a solution? These are some of the questions we’ll address in 
the next section. 


Exercises 1.3 


In Exercises 1 — 4, state whether or not the 1 0 0 
given matrices are in reduced row echelon (c) }O O 1 
form. If it is not, state why. 0 0 0 
- - 100 —-5 
1. (a) |? 1 () [7 | (d) }o 10 7 
[oO 4 [i 1 001 3 
fo 1 fl o 1 
) 4 sl yo 4 | ao Be 
_ . 4. (a) |O 2 O 
1 00 0 0 O 
0 02 2 
a Me) lo Oo | (c) a a 
fa. <@:-4 fo 0 O Poa 
(b) 0 4 | (d) -— 4 (b) |}O0 O 1 O 
L L 0 0 0 0 
1141 0 01 —-5 
3 (a) |]O 1 1 (c) |O O O 
0 0 1 0 0 0 
100 1100141 
(b) |O 1 O (d) }O0 O 10 1 1 
0 0 0 00010 0 
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In Exercises 5 — 22, use Gaussian Elimination 
to put the given matrix into reduced row ech- 


elon form. 
fi 2 
Balle 2 
. 
2 -2 
6. | = 
s 
4 12 
| =| 
a 
—-5 7 
. | 10 a 
. 
—-1 1 4 
a |-2 1 1] 
- 
7 2 3 
13 4 | 
7 
3 —3 6 
a re =] 
. 
4 5 —6 
ae (ee ‘a 
—2 -4 -8 
13. —2 -3 —-5 
2 3 6 
2 11 
14. 111 
2 1 2 


15. 


16. 


17. 


18. 


19. 


20. 


1.4 Existence and Uniqueness of Solutions 


AS YOU READ 


1 


. T/F: It is possible for a linear system to have exactly 5 solutions. 


T/F: A variable that corresponds to a leading 1 is “free.” 
How can one tell what kind of solution a linear system of equations has? 


Give an example (different from those given in the text) of a 2 equation, 2 un- 
known linear system that is not consistent. 


T/F: A particular solution for a linear system with infinite solutions can be found 


by arbitrarily picking values for the free variables. 


1.4 Existence and Uniqueness of Solutions 


So far, whenever we have solved a system of linear equations, we have always 
found exactly one solution. This is not always the case; we will find in this section that 
some systems do not have a solution, and others have more than one. 

We start with a very simple example. Consider the following linear system: 


x-—y=0. 


There are obviously infinite solutions to this system; as long as x = y, we have a so- 
lution. We can picture all of these solutions by thinking of the graph of the equation 
y = x on the traditional x, y coordinate plane. 

Let’s continue this visual aspect of considering solutions to linear systems. Con- 
sider the system 


x+ty=2 
x-—y=0. 


Each of these equations can be viewed as lines in the coordinate plane, and since their 
slopes are different, we know they will intersect somewhere (see Figure 1.1 (a)). In 
this example, they intersect at the point (1,1) — that is, when x = 1 and y = 1, both 
equations are satisfied and we have a solution to our linear system. Since this is the 
only place the two lines intersect, this is the only solution. 

Now consider the linear system 


xt+ty=1 
2x + 2y = 2. 


It is clear that while we have two equations, they are essentially the same equation; 
the second is just a multiple of the first. Therefore, when we graph the two equations, 
we are graphing the same line twice (see Figure 1.1 (b); the thicker line is used to 
represent drawing the line twice). In this case, we have an infinite solution set, just as 
if we only had the one equation x + y = 1. We often write the solution as x = 1—yto 
demonstrate that y can be any real number, and x is determined once we pick a value 
for y. 


(a) (b) (c) 


Figure 1.1: The three possibilities for two linear equations with two unknowns. 
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Finally, consider the linear system 


xXt+ty=1 


x+y=2. 


We should immediately spot a problem with this system; if the sum of x and y is 1, 
how can it also be 2? There is no solution to such a problem; this linear system has no 
solution. We can visualize this situation in Figure 1.1 (c); the two lines are parallel and 
never intersect. 

If we were to consider a linear system with three equations and two unknowns, we 
could visualize the solution by graphing the corresponding three lines. We can picture 
that perhaps all three lines would meet at one point, giving exactly 1 solution; per- 
haps all three equations describe the same line, giving an infinite number of solutions; 
perhaps we have different lines, but they do not all meet at the same point, giving 
no solution. We further visualize similar situations with, say, 20 equations with two 
variables. 

While it becomes harder to visualize when we add variables, no matter how many 
equations and variables we have, solutions to linear equations always come in one of 
three forms: exactly one solution, infinite solutions, or no solution. This is a fact that 
we will not prove here, but it deserves to be stated. 


Theorem 1 Solution Forms of Linear Systems 


Every linear system of equations has exactly one solution, 
infinite solutions, or no solution. 


This leads us to a definition. Here we don’t differentiate between having one so- 
lution and infinite solutions, but rather just whether or not a solution exists. 


Definition 5 Consistent and Inconsistent Linear Systems 


A system of linear equations is consistent if it has a solution 
(perhaps more than one). A linear system is inconsistent if 
it does not have a solution. 


How can we tell what kind of solution (if one exists) a given system of linear equa- 
tions has? The answer to this question lies with properly understanding the reduced 
row echelon form of a matrix. To discover what the solution is to a linear system, we 
first put the matrix into reduced row echelon form and then interpret that form prop- 
erly. 

Before we start with a simple example, let us make a note about finding the re- 


1.4 Existence and Uniqueness of Solutions 


duced row echelon form of a matrix. 


Technology Note: In the previous section, we learned how to find the reduced row 
echelon form of a matrix using Gaussian elimination — by hand. We need to know how 
to do this; understanding the process has benefits. However, actually executing the 
process by hand for every problem is not usually beneficial. In fact, with large systems, 
computing the reduced row echelon form by hand is effectively impossible. Our main 
concern is what “the rref” is, not what exact steps were used to arrive there. Therefore, 
the reader is encouraged to employ some form of technology to find the reduced row 
echelon form. Computer programs such as Mathematica, MATLAB, Maple, and Derive 
can be used; many handheld calculators (such as Texas Instruments calculators) will 
perform these calculations very quickly. 

As a general rule, when we are learning a new technique, it is best to not use 
technology to aid us. This helps us learn not only the technique but some of its “inner 
workings.” We can then use technology once we have mastered the technique and are 
now learning how to use it to solve problems. 

From here on out, in our examples, when we need the reduced row echelon form 
of a matrix, we will not show the steps involved. Rather, we will give the initial matrix, 
then immediately give the reduced row echelon form of the matrix. We trust that the 
reader can verify the accuracy of this form by both performing the necessary steps by 
hand or utilizing some technology to do it for them. 


Our first example explores officially a quick example used in the introduction of 
this section. 


Example 8 Find the solution to the linear system 
Xx + X% = 1 
2X, + 2x = 2° 
SOLUTION Create the corresponding augmented matrix, and then put the ma- 


trix into reduced row echelon form. 


111 rer 111 
22 2 0 0 0 
Now convert the reduced matrix back into equations. In this case, we only have 
one equation, 


Xi +xX=1 
or, equivalently, 

X, =1-x 

X2 is free. 


We have just introduced a new term, the word free. It is used to stress that idea 
that x2 can take on any value; we are “free” to choose any value for x2. Once this value 
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is chosen, the value of x; is determined. We have infinite choices for the value of x, 
so therefore we have infinite solutions. 
For example, if we set x2 = O, then x, = 1; if we set x2 = 5, then x, = —4. 


Let’s try another example, one that uses more variables. 


Example 9 Find the solution to the linear system 
X2 — X3 = 3 
X =F 2x3 = 2 
—3x> +r 3x3 = -9 
SOLUTION To find the solution, put the corresponding matrix into reduced 


row echelon form. 


0 1 -1 3 10 2 2 
— 

1 O 2 2 rref 01 -1 3 

0 -3 3 + -9 00 0 0 


Now convert this reduced matrix back into equations. We have 


X1 + 2x3 = 2 


X2 — X3 = 3 
or, equivalently, 


X, = 2 — 2x3 
XX. =3+x3 


X3 is free. 


These two equations tell us that the values of x; and x7 depend on what x3 is. As 
we saw before, there is no restriction on what x3 must be; it is “free” to take on the 
value of any real number. Once x3 is chosen, we have a solution. Since we have infinite 
choices for the value of x3, we have infinite solutions. 

As examples, x; = 2, x2 = 3, x3 = Ois one solution; x; = —2, x2 = 5, x3 = 2 is 
another solution. Try plugging these values back into the original equations to verify 
that these indeed are solutions. (By the way, since infinite solutions exist, this system 
of equations is consistent.) 


In the two previous examples we have used the word “free” to describe certain 
variables. What exactly is a free variable? How do we recognize which variables are 
free and which are not? 

Look back to the reduced matrix in Example 8. Notice that there is only one leading 
1 in that matrix, and that leading 1 corresponded to the x, variable. That told us that 
X, was not a free variable; since x, did not correspond to a leading 1, it was a free 
variable. 


1.4 Existence and Uniqueness of Solutions 


Look also at the reduced matrix in Example 9. There were two leading 1s in that 
matrix; one corresponded to x; and the other to x. This meant that x; and x2 were 
not free variables; since there was not a leading 1 that corresponded to x3, it was a 
free variable. 


We formally define this and a few other terms in this following definition. 


Definition 6 Dependent and Independent Variables 


Consider the reduced row echelon form of an augmented 
matrix of a linear system of equations. Then: 


a variable that corresponds to a leading 1 is a basic, or 
dependent, variable, and 


a variable that does not correspond to a leading 1 is a free, 
or independent, variable. 


One can probably see that “free” and “independent” are relatively synonymous. It 
follows that if a variable is not independent, it must be dependent; the word “basic” 
comes from connections to other areas of mathematics that we won’t explore here. 

These definitions help us understand when a consistent system of linear equations 
will have infinite solutions. If there are no free variables, then there is exactly one 
solution; if there are any free variables, there are infinite solutions. 


Key Idea 2 Consistent Solution Types 


A consistent linear system of equations will have exactly 
one solution if and only if there is a leading 1 for each 
variable in the system. 


If a consistent linear system of equations has a free variable, 
it has infinite solutions. 


If a consistent linear system has more variables than leading 
1s, then the system will have infinite solutions. 


A consistent linear system with more variables than equa- 
tions will always have infinite solutions. 


Note: Key Idea 2 applies only to consistent systems. If a system is inconsistent, 
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then no solution exists and talking about free and basic variables is meaningless. 


When a consistent system has only one solution, each equation that comes from 
the reduced row echelon form of the corresponding augmented matrix will contain 
exactly one variable. If the consistent system has infinite solutions, then there will be 
at least one equation coming from the reduced row echelon form that contains more 
than one variable. The “first” variable will be the basic (or dependent) variable; all 
others will be free variables. 

We have now seen examples of consistent systems with exactly one solution and 
others with infinite solutions. How will we recognize that a system is inconsistent? 
Let’s find out through an example. 


Example 10 Find the solution to the linear system 
Xi + xX. + *% = 1 
Xx + 2x. + x3 = 2. 
2X1 +r 3x + 2x3 = 0 
SOLUTION We start by putting the corresponding matrix into reduced row 


echelon form. 


Now let us take the reduced matrix and write out the corresponding equations. 
The first two rows give us the equations 


X, + x3 =0 


X2 = 0. 


So far, so good. However the last row gives us the equation 
Ox, + Ox> + Ox3 =1 


or, more concisely, 0 = 1. Obviously, this is not true; we have reached a contradiction. 
Therefore, no solution exists; this system is inconsistent. 


In previous sections we have only encountered linear systems with unique solu- 
tions (exactly one solution). Now we have seen three more examples with different 
solution types. The first two examples in this section had infinite solutions, and the 
third had no solution. How can we tell if a system is inconsistent? 

A linear system will be inconsistent only when it implies that 0 equals 1. We can 
tell if a linear system implies this by putting its corresponding augmented matrix into 
reduced row echelon form. If we have any row where all entries are 0 except for the 
entry in the last column, then the system implies O=1. More succinctly, if we have a 
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leading 1 in the last column of an augmented matrix, then the linear system has no 
solution. 


Key Idea 3 Inconsistent Systems of Linear Equations 


A system of linear equations is inconsistent if the reduced 


row echelon form of its corresponding augmented matrix 
has a leading 1 in the last column. 


Example 11 Confirm that the linear system 
x + y = 0 
2x + 22 = 4 


has no solution. 


SOLUTION We can verify that this system has no solution in two ways. First, 
let’s just think about it. If x+y = 0, then it stands to reason, by multiplying both sides 
of this equation by 2, that 2x + 2y = 0. However, the second equation of our system 
says that 2x + 2y = 4. Since 0 # 4, we have a contradiction and hence our system has 
no solution. (We cannot possibly pick values for x and y so that 2x + 2y equals both 0 
and 4.) 


Now let us confirm this using the prescribed technique from above. The reduced 
row echelon form of the corresponding augmented matrix is 


1 1 0 
0 0 1}° 
We have a leading 1 in the last column, so therefore the system is inconsistent. 


Let’s summarize what we have learned up to this point. Consider the reduced row 
echelon form of the augmented matrix of a system of linear equations. If there is a 
leading 1 in the last column, the system has no solution. Otherwise, if there is a leading 
1 for each variable, then there is exactly one solution; otherwise (i.e., there are free 
variables) there are infinite solutions. 


Systems with exactly one solution or no solution are the easiest to deal with; sys- 
tems with infinite solutions are a bit harder to deal with. Therefore, we’ll do a little 
more practice. First, a definition: if there are infinite solutions, what do we call one of 
those infinite solutions? 


3That sure seems like a mouthful in and of itself. However, it boils down to “look at the reduced form of 
the usual matrix.” 
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Definition 7 Particular Solution 


Consider a linear system of equations with infinite solutions. 
A particular solution is one solution out of the infinite set of 
possible solutions. 


The easiest way to find a particular solution is to pick values for the free variables 
which then determines the values of the dependent variables. Again, more practice is 
called for. 


Example 12 Give the solution to a linear system whose augmented matrix in 
reduced row echelon form is 


and give two particular solutions. 


SOLUTION We can essentially ignore the third row; it does not divulge any 
information about the solution.’ The first and second rows can be rewritten as the 
following equations: 


X1 —X2 + 2x, = 4 


X3 — 3X, = 7. 


Notice how the variables x, and x3 correspond to the leading 1s of the given matrix. 
Therefore x; and x3 are dependent variables; all other variables (in this case, x2 and 
Xq) are free variables. 

We generally write our solution with the dependent variables on the left and inde- 
pendent variables and constants on the right. It is also a good practice to acknowledge 
the fact that our free variables are, in fact, free. So our final solution would look some- 
thing like 


xX, =44+xX2 — 2x4 


X2 is free 


X3 = 7+ 3X, 


X, is free. 


To find particular solutions, choose values for our free variables. There is no “right” 
way of doing this; we are “free” to choose whatever we wish. 


4Then why include it? Rows of zeros sometimes appear “unexpectedly” in matrices after they have been 
put in reduced row echelon form. When this happens, we do learn something; it means that at least one 
equation was a combination of some of the others. 
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By setting x. = O = xq, we have the solution x; 4, X2 0, x3 7, X4 0. 


By setting x7 = 1 and x, = —5, we have the solution x; = 15, x7 = 1, x3 = —8, 
Xq = —5. It is easier to read this when are variables are listed vertically, so we repeat 
these solutions: 
One particular solution is: Another particular solution is: 
xy = 4 xy = 15 
x2 = 0 x2 = 1 
x3 = 7 x3 = —8 
x4 = 0. x4 = —5. 
Example 13 Find the solution to a linear system whose augmented matrix in 


reduced row echelon form is 


100 2 3 
0104 °5 


and give two particular solutions. 


SOLUTION Converting the two rows into equations we have 


Xo 2X4 =3 


X2 + 4x, = 5. 


We see that x; and x2 are our dependent variables, for they correspond to the 
leading 1s. Therefore, x3 and x, are independent variables. This situation feels a little 
unusual,” for x3 doesn’t appear in any of the equations above, but cannot overlook it; 
it is still a free variable since there is not a leading 1 that corresponds to it. We write 
our solution as: 


xy = 3- 2X4 
X2 =5—-—4x, 
X3 is free 


Xq is free. 


To find two particular solutions, we pick values for our free variables. Again, there 
is no “right” way of doing this (in fact, there are ... infinite ways of doing this) so we 
give only an example here. 


5What kind of situation would lead to a column of all zeros? To have such a column, the original ma- 
trix needed to have a column of all zeros, meaning that while we acknowledged the existence of a certain 
variable, we never actually used it in any equation. In practical terms, we could respond by removing the 
corresponding column from the matrix and just keep in mind that that variable is free. In very large systems, 
it might be hard to determine whether or not a variable is actually used and one would not worry about it. 

When we learn about eigenvectors and eigenvalues, we will see that under certain circumstances this 
situation arises. In those cases we leave the variable in the system just to remind ourselves that it is there. 
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One particular solution is: Another particular solution is: 
xy = 3 xi = 3-27 
x%2=5 X»=5-47 
x3 = 1000 xX, =e? 

X4 = 0. xX4 = T. 


|’ 


(In the second particular solution we picked “unusua 
light the fact that we can.) 


values for x3 and x, just to high- 


Example 14 Find the solution to the linear system 
X41 + X2 + X3 = 5 
Xy — X2 tr X = 3 


and give two particular solutions. 


SOLUTION The corresponding augmented matrix and its reduced row echelon 
form are given below. 


1115 = 1014 
i oat a 3 oe 0101 


Converting these two rows into equations, we have 


X,+x3=4 
x2 = 1 
giving us the solution 
xX, = 4-x3 
x2 = 1 
X3 is free. 


|” 


Once again, we get a bit of an “unusual” solution; while x2 is a dependent variable, 
it does not depend on any free variable; instead, it is always 1. (We can think of it 
as depending on the value of 1.) By picking two values for x3, we get two particular 


solutions. 
One particular solution is: Another particular solution is: 
y= 4 xy = 3 
x2= 1 x2 = 1 
RB 0. x3 = 1. 
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The constants and coefficients of a matrix work together to determine whether a 
given system of linear equations has one, infinite, or no solution. The concept will be 
fleshed out more in later chapters, but in short, the coefficients determine whether a 
matrix will have exactly one solution or not. In the “or not” case, the constants deter- 
mine whether or not infinite solutions or no solution exists. (So if a given linear system 
has exactly one solution, it will always have exactly one solution even if the constants 
are changed.) Let’s look at an example to get an idea of how the values of constants 
and coefficients work together to determine the solution type. 


Example 15 For what values of k will the given system have exactly one solu- 
tion, infinite solutions, or no solution? 


Xi + 2x = 3 
3x1 + kx» = 9 
SOLUTION We answer this question by forming the augmented matrix and 


starting the process of putting it into reduced row echelon form. Below we see the 
augmented matrix and one elementary row operation that starts the Gaussian elimi- 
nation process. 


i243) eee Te 2. 3 
E k | oe earns E k-9 | 


This is as far as we need to go. In looking at the second row, we see that if k = 9, 
then that row contains only zeros and x2 is a free variable; we have infinite solutions. 
If k 4 9, then our next step would be to make that second row, second column en- 
try a leading one. We don’t particularly care about the solution, only that we would 
have exactly one as both x; and x2 would correspond to a leading one and hence be 
dependent variables. 

Our final analysis is then this. If k 4 9, there is exactly one solution; if k = 9, there 
are infinite solutions. In this example, it is not possible to have no solutions. 


As an extension of the previous example, consider the similar augmented matrix 
where the constant 9 is replaced with a 10. Performing the same elementary row 
operation gives 


12 3 ee ee a oe 
E k | eR Re oP E k-9 ale 


As in the previous example, if k 4 9, we can make the second row, second column 
entry a leading one and hence we have one solution. However, if k = 9, then our last 
row is [0 0 1], meaning we have no solution. 


We have been studying the solutions to linear systems mostly in an “academic” 
setting; we have been solving systems for the sake of solving systems. In the next sec- 


tion, we'll look at situations which create linear systems that need solving (i.e., “word 
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problems”). 


Exercises 1.4 


In Exercises 1 — 14, find the solution to the 
given linear system. If the system has infinite 


solutions, give 2 particular solutions. 


1 2X1 ry 4x2 => 2 
: Xi oa 2X2 = 1 
2 —X1 + 5x2 = 3 
: 2X1 = 10x = -6 
X1 + X2 = 
3. 
2X1 + X2 = 
4 3x1 7X2 => 7 
: 2X1 = 8x2 => 8 
5 2X1 + 3x2 => 1 
: 2X1 3x2 = 1 
6 Xt + 2X2 = 1 
: —X1 = 2X2 => 5 
7 —2x, + 4x2 + 4x3 
: X1 — 3x2 + 2X3 
8 X1 2X2 2x3 = 
. 2X1 + 5x2 +r X3 = 
9 Xi X2 + x3 Xa => 0 
. —2x1 = 2X2 + X3 = —1 
X1 + X2 + 6x3 +9x, = O 
10. 
X1 X3 2X4 = 


12. 


13. 


14. 


1x1 
—3x1 


X2 2x. 3 => 0 

X2 3x: 3 = of 

2x D: 5x: 3 = 3 

3x 2 ca 3x: 3 => 1 

X2 2x: 3 = 1 

5x 2 8x: 3 => 2 

2x 2 + 2x 3 => 1 

X2 + 3x: 3 => dl 

3x 2 + 5x: 3 => Zz 
i i 4x. 2 oi 6x: 3 => 2 
+r 2x: 2 + 3x: 3 = 1 
= 6x. 2 — 9x: 3 => —3 


In Exercises 15 — 18, state for which values 
of k the given system will have exactly 1 solu- 
tion, infinite solutions, or no solution. 


15. 


16. 


17. 


18. 
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AS YOU READ 


1. How do most problems appear “in the real world?” 


2. The unknowns in a problem are also called what? 


1 
k 


3. How many points are needed to determine the coefficients of a 5” degree poly- 


nomial? 


We’ve started this chapter by addressing the issue of finding the solution to a sys- 
tem of linear equations. In subsequent sections, we defined matrices to store linear 
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equation information; we described how we can manipulate matrices without chang- 
ing the solutions; we described how to efficiently manipulate matrices so that a work- 
ing solution can be easily found. 

We shouldn’t lose sight of the fact that our work in the previous sections was aimed 
at finding solutions to systems of linear equations. In this section, we’ll learn how to 
apply what we’ve learned to actually solve some problems. 

Many, many, many problems that are addressed by engineers, businesspeople, 
scientists and mathematicians can be solved by properly setting up systems of linear 
equations. In this section we highlight only a few of the wide variety of problems that 
matrix algebra can help us solve. 

We start with a simple example. 


Example 16 A jar contains 100 blue, green, red and yellow marbles. There are 
twice as many yellow marbles as blue; there are 10 more blue marbles than red; the 
sum of the red and yellow marbles is the same as the sum of the blue and green. How 
many marbles of each color are there? 


SOLUTION Let’s call the number of blue balls b, and the number of the other 
balls g, rand y, each representing the obvious. Since we know that we have 100 mar- 
bles, we have the equation 


b+g+r+y= 100. 


The next sentence in our problem statement allows us to create three more equations. 


We are told that there are twice as many yellow marbles as blue. One of the fol- 
lowing two equations is correct, based on this statement; which one is it? 


2y=b or 2b=y 


The first equation says that if we take the number of yellow marbles, then double 
it, we’ll have the number of blue marbles. That is not what we were told. The second 
equation states that if we take the number of blue marbles, then double it, we’ll have 
the number of yellow marbles. This is what we were told. 

The next statement of “there are 10 more blue marbles as red” can be written as 
either 


b=r+10 or r=b+10. 


Which is it? 

The first equation says that if we take the number of red marbles, then add 10, 
we’ll have the number of blue marbles. This is what we were told. The next equation 
is wrong; it implies there are more red marbles than blue. 


The final statement tells us that the sum of the red and yellow marbles is the same 
as the sum of the blue and green marbles, giving us the equation 


r+y=b+g. 
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We have four equations; altogether, they are 


b+g+r+y=100 


2b=y 
b=r-+10 
r+y=b+g. 


We want to write these equations in a standard way, with all the unknowns on the 
left and the constants on the right. Let us also write them so that the variables appear 
in the same order in each equation (we’ll use alphabetical order to make it simple). 
We now have 


b+g+r+y= 4100 
2b—y=0 
b—r=10 
—b-—g+r+y=0 


To find the solution, let’s form the appropriate augmented matrix and put it into 
reduced row echelon form. We do so here, without showing the steps. 


al 1 1 1 100 1 00 0 20 

2 0 0 -1 O 0 10 0 30 
rref 

1 0 -1 O 10 001 0 10 

—1 -1 1 1 0 0 00 1 40 


We interpret from the reduced row echelon form of the matrix that we have 20 
blue, 30 green, 10 red and 40 yellow marbles. 


Even if you had a bit of difficulty with the previous example, in reality, this type 
of problem is pretty simple. The unknowns were easy to identify, the equations were 
pretty straightforward to write (maybe a bit tricky for some), and only the necessary 
information was given. 

Most problems that we face in the world do not approach us in this way; most 
problems do not approach us in the form of “Here is an equation. Solve it.” Rather, 
most problems come in the form of: 


Here is a problem. | want the solution. To help, here is lots of information. 
It may be just enough; it may be too much; it may not be enough. You 
figure out what you need; just give me the solution. 


Faced with this type of problem, how do we proceed? Like much of what we’ve 
done in the past, there isn’t just one “right” way. However, there are a few steps 
that can guide us. You don’t have to follow these steps, “step by step,” but if you find 
that you are having difficulty solving a problem, working through these steps may help. 
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(Note: while the principles outlined here will help one solve any type of problem, these 
steps are written specifically for solving problems that involve only linear equations.) 


Key Idea 4 Mathematical Problem Solving 


1. Understand the problem. What exactly is being 
asked? 


. Identify the unknowns. What are you trying to find? 
What units are involved? 


. Give names to your unknowns (these are your vari- 
ables). 


. Use the information given to write as many equations 
as you can that involve these variables. 


. Use the equations to form an augmented matrix; use 
Gaussian elimination to put the matrix into reduced 
row echelon form. 


. Interpret the reduced row echelon form of the matrix 
to identify the solution. 


. Ensure the solution makes sense in the context of the 
problem. 


Having identified some steps, let us put them into practice with some examples. 


Example 17 A concert hall has seating arranged in three sections. As part of a 
special promotion, guests will recieve two of three prizes. Guests seated in the first 
and second sections will receive Prize A, guests seated in the second and third sections 
will receive Prize B, and guests seated in the first and third sections will receive Prize 
C. Concert promoters told the concert hall managers of their plans, and asked how 
many seats were in each section. (The promoters want to store prizes for each section 
separately for easier distribution.) The managers, thinking they were being helpful, 
told the promoters they would need 105 A prizes, 103 B prizes, and 88 C prizes, and 
have since been unavailable for further help. How many seats are in each section? 


SOLUTION Before we rush in and start making equations, we should be clear 
about what is being asked. The final sentence asks: “How many seats are in each 
section?” This tells us what our unknowns should be: we should name our unknowns 
for the number of seats in each section. Let x;, x2 and x3 denote the number of seats 
in the first, second and third sections, respectively. This covers the first two steps of 
our general problem solving technique. 
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(It is tempting, perhaps, to name our variables for the number of prizes given away. 
However, when we think more about this, we realize that we already know this — that 
information is given to us. Rather, we should name our variables for the things we 
don’t know.) 

Having our unknowns identified and variables named, we now proceed to forming 
equations from the information given. Knowing that Prize A goes to guests in the first 
and second sections and that we’ll need 105 of these prizes tells us 


Xi +X = 105. 
Proceeding in a similar fashion, we get two more equations, 
X2 +x3=103 and x,+x3 = 88. 


Thus our linear system is 


Xi +X. = 105 
X2 + X30 = 103 
Xi +x3 = 88 


and the corresponding augmented matrix is 


O 105 
1 103 
1 88 


To solve our system, let’s put this matrix into reduced row echelon form. 


1 1 0 105 1 0 0 45 
0 1 1 103 rref 0 1 0 60 
101 88 001 43 
We can now read off our solution. The first section has 45 seats, the second has 
60 seats, and the third has 43 seats. 
Example 18 Alady takes a 2-mile motorized boat trip down the Highwater River, 


knowing the trip will take 30 minutes. She asks the boat pilot “How fast does this river 
flow?” He replies “I have no idea, lady. | just drive the boat.” 

She thinks for a moment, then asks “How long does the return trip take?” He 
replies “The same; half an hour.” She follows up with the statement, “Since both legs 
take the same time, you must not drive the boat at the same speed.” 

“Naw,” the pilot said. “While | really don’t know exactly how fast | go, | do know 
that since we don’t carry any tourists, | drive the boat twice as fast.” 

The lady walks away satisfied; she knows how fast the river flows. 

(How fast does it flow?) 


SOLUTION This problem forces us to think about what information is given 
and how to use it to find what we want to know. In fact, to find the solution, we'll find 
out extra information that we weren’t asked for! 
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We are asked to find how fast the river is moving (step 1). To find this, we should 
recognize that, in some sense, there are three speeds at work in the boat trips: the 
speed of the river (which we want to find), the speed of the boat, and the speed that 
they actually travel at. 

We know that each leg of the trip takes half an hour; if it takes half an hour to cover 
2 miles, then they must be traveling at 4 mph, each way. 

The other two speeds are unknowns, but they are related to the overall speeds. 
Let’s call the speed of the river r and the speed of the boat b. (And we should be 
careful. From the conversation, we know that the boat travels at two different speeds. 
So we’ll say that b represents the speed of the boat when it travels downstream, so 
2b represents the speed of the boat when it travels upstream.) Let’s let our speed be 
measured in the units of miles/hour (mph) as we used above (steps 2 and 3). 

What is the rate of the people on the boat? When they are travelling downstream, 
their rate is the sum of the water speed and the boat speed. Since their overall speed 
is 4 mph, we have the equation r+ b = 4. 

When the boat returns going against the current, its overall speed is the rate of 
the boat minus the rate of the river (since the river is working against the boat). The 
overall trip is still taken at 4 mph, so we have the equation 2b — r = 4. (Recall: the 
boat is traveling twice as fast as before.) 

The corresponding augmented matrix is 


1 1 4 
2 -1 4)" 
Note that we decided to let the first column hold the coefficients of b. 
Putting this matrix in reduced row echelon form gives us: 
1 1 4 aa 1 0 8/3 
2 -1 4 0 1 4/3|° 
We finish by interpreting this solution: the speed of the boat (going downstream) 


is 8/3 mph, or 2.6 mph, and the speed of the river is 4/3 mph, or 1.3 mph. All we really 
wanted to know was the speed of the river, at about 1.3 mph. 


Example 19 Find the equation of the quadratic function that goes through the 
points (—1,6), (1, 2) and (2, 3). 


SOLUTION This may not seem like a “linear” problem since we are talking 
about a quadratic function, but closer examination will show that it really is. 

We normally write quadratic functions as y = ax? + bx + c where a, b andcare 
the coefficients; in this case, they are our unknowns. We have three points; consider 
the point (—1, 6). This tells us directly that if x = —1, then y = 6. Therefore we know 
that 6 = a(—1)?+b(—1)+c. Writing this in a more standard form, we have the linear 
equation 


c 
The second point tells us that a(1)? + b 


(1) + ¢ = 2, which we can simplify as 
a+b+c=2,and the last point tells us a(2)? 


+ b(2) +c =3,or4a+2b+c=3. 
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Thus our linear system is 


a—b+c = 6 
a+b+c = 2 
4a+2b+c = 3. 


Again, to solve our system, we find the reduced row echelon form of the corre- 
sponding augmented matrix. We don’t show the steps here, just the final result. 


1 -1 1 6 100 1 
1 1 12 rre 010 -2 
4 2 1 3 001 3 
This tells us that a = 1, b = —2 andc = 3, giving us the quadratic function 


y=x*—2x4+3. 


One thing interesting about the previous example is that it confirms for us some- 
thing that we may have known for a while (but didn’t know why it was true). Why do 
we need two points to find the equation of the line? Because in the equation of the 
a line, we have two unknowns, and hence we’ll need two equations to find values for 
these unknowns. 

A quadratic has three unknowns (the coefficients of the x* term and the xterm, and 
the constant). Therefore we’ll need three equations, and therefore we’ll need three 
points. 

What happens if we try to find the quadratic function that goes through 3 points 
that are all on the same line? The fast answer is that you’ll get the equation of a line; 
there isn’t a quadratic function that goes through 3 colinear points. Try it and see! 
(Pick easy points, like (0,0), (1,1) and (2,2). You'll find that the coefficient of the x? 
term is 0.) 

Of course, we can do the same type of thing to find polynomials that go through 4, 
5, etc., points. In general, if you are given n+ 1 points, a polynomial that goes through 
alln + 1 points will have degree at most n. 


Example 20 A woman has 32 $1, $5 and $10 bills in her purse, giving her a total 
of $100. How many bills of each denomination does she have? 


SOLUTION Let’s name our unknowns x, y and z for our ones, fives and tens, 
respectively (it is tempting to call them o, f and t, but o looks too much like 0). We 
know that there are a total of 32 bills, so we have the equation 


Xty+z= 32. 
We also know that we have $100, so we have the equation 
x+5y+ 10z = 100. 


We have three unknowns but only two equations, so we know that we cannot expect 
a unique solution. Let’s try to solve this system anyway and see what we get. 
Putting the system into a matrix and then finding the reduced row echelon form, 


we have 
ie a ae 4 1 0 -2 15 
rref 9 : 
1 5 10 100 O11 5 17 
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Reading from our reduced matrix, we have the infinite solution set 


_ 5 
cE ha 
9 
eae 
Zis free. 


While we do have infinite solutions, most of these solutions really don’t make sense 
in the context of this problem. (Setting z = ; doesn’t make sense, for having half a 
ten dollar bill doesn’t give us $5. Likewise, having z = 8 doesn’t make sense, for then 
we’d have “—1” S5 bills.) So we must make sure that our choice of z doesn’t give us 
fractions of bills or negative amounts of bills. 

To avoid fractions, z must be a multiple of 4 (—4, 0,4, 8,...). Of course, z > O for 
a negative number wouldn’t make sense. If z = O, then we have 15 one dollar bills 
and 17 five dollar bills, giving us $100. If z = 4, then we have x = 20 andy = 8. 
We already mentioned that z = 8 doesn’t make sense, nor does any value of z where 
z> 8. 

So it seems that we have two answers; one with z = O and one with z = 4. Of 
course, by the statement of the problem, we are led to believe that the lady has at 
least one $10 bill, so probably the “best” answer is that we have 20 S11 bills, 8 $5 bills 
and 4 $10 bills. The real point of this example, though, is to address how infinite solu- 
tions may appear in a real world situation, and how suprising things may result. 


Example 21 In a football game, teams can score points through touchdowns 
worth 6 points, extra points (that follow touchdowns) worth 1 point, two point con- 
versions (that also follow touchdowns) worth 2 points and field goals, worth 3 points. 
You are told that in a football game, the two competing teams scored on 7 occasions, 
giving a total score of 24 points. Each touchdown was followed by either a successful 
extra point or two point conversion. In what ways were these points scored? 


SOLUTION The question asks how the points were scored; we can interpret 
this as asking how many touchdowns, extra points, two point conversions and field 
goals were scored. We’ll need to assign variable names to our unknowns; let t rep- 
resent the number of touchdowns scored; let x represent the number of extra points 
scored, let w represent the number of two point conversions, and let f represent the 
number of field goals scored. 

Now we address the issue of writing equations with these variables using the given 
information. Since we have a total of 7 scoring occasions, we know that 


t+xt+w+f=7. 


The total points scored is 24; considering the value of each type of scoring opportunity, 
we can write the equation 


6t +x +2w+4 3f=24. 
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Finally, we know that each touchdown was followed by a successful extra point or two 
point conversion. This is subtle, but it tells us that the number of touchdowns is equal 
to the sum of extra points and two point conversions. In other words, 


t=x-+w. 


To solve our problem, we put these equations into a matrix and put the matrix into 
reduced row echelon form. Doing so, we find 


1 1 1 1 7 1 0 0 
— 
6 1 2 3 24 rref 010 1 4 
1 -1 -1 0 0 001 
Therefore, we know that 


t=3.5—0.5f 
x=4-f 
w= —0.54+ 0.5f. 


We recognize that this means there are “infinite solutions,” but of course most of these 
will not make sense in the context of a real football game. We must apply some logic 
to make sense of the situation. 

Progressing in no particular order, consider the second equation, x = 4 — f. In 
order for us to have a positive number of extra points, we must have f < 4. (And of 
course, we need f > 0, too.) Therefore, right away we know we have a total of only 5 
possibilities, where f = 0, 1, 2, 3 or 4. 

From the first and third equations, we see that if fis an even number, then t and w 
will both be fractions (for instance, if f = 0, then t = 3.5) which does not make sense. 
Therefore, we are down to two possible solutions, f = 1 and f = 3. 

If f = 1, we have 3 touchdowns, 3 extra points, no two point conversions, and 
(of course), 1 field goal. (Check to make sure that gives 24 points!) If f = 3, then 
we 2 touchdowns, 1 extra point, 1 two point conversion, and (of course) 3 field goals. 
Again, check to make sure this gives us 24 points. Also, we should check each solution 
to make sure that we have a total of 7 scoring occasions and that each touchdown 
could be followed by an extra point or a two point conversion. 


We have seen a variety of applications of systems of linear equations. We would 
do well to remind ourselves of the ways in which solutions to linear systems come: 
there can be exactly one solution, infinite solutions, or no solutions. While we did see 
a few examples where it seemed like we had only 2 solutions, this was because we 
were restricting our solutions to “make sense” within a certain context. 

We should also remind ourselves that linear equations are immensely important. 
The examples we considered here ask fundamentally simple questions like “How fast is 
the water moving?” or “What is the quadratic function that goes through these three 
points?” or “How were points in a football game scored?” The real “important” situ- 
ations ask much more difficult questions that often require thousands of equations! 


1.5 Applications of Linear Systems 


(Gauss began the systematic study of solving systems of linear equations while trying 
to predict the next sighting of a comet; he needed to solve a system of linear equations 
that had 17 unknowns. Today, this a relatively easy situation to handle with the help of 
computers, but to do it by hand is a real pain.) Once we understand the fundamentals 
of solving systems of equations, we can move on to looking at solving bigger systems 
of equations; this text focuses on getting us to understand the fundamentals. 


Exercises 1.5 


In Exercises 1 — 5, find the solution of the 
given problem by: 


(a) creating an appropriate system of linear 
equations 


(b) forming the augmented matrix that cor- 
responds to this system 


(c) putting the augmented matrix into re- 
duced row echelon form 


(d) interpreting the reduced row echelon 
form of the matrix as a solution 


1. A farmer looks out his window at his 
chickens and pigs. He tells his daugh- 
ter that he sees 62 heads and 190 legs. 
How many chickens and pigs does the 
farmer have? 


2. A lady buys 20 trinkets at a yard sale. 
The cost of each trinket is either $0.30 
or $0.65. If she spends $8.80, how 
many of each type of trinket does she 
buy? 


3. Acarpenter can make two sizes of table, 
grande and venti. The grande table re- 
quires 4 table legs and 1 table top; the 
venti requires 6 table legs and 2 table 
tops. After doing work, he counts up 
spare parts in his warehouse and real- 
izes that he has 86 table tops left over, 
and 300 legs. How many tables of each 
kind can he build and use up exactly all 
of his materials? 


4. A jar contains 100 marbles. We know 
there are twice as many green marbles 
as red; that the number of blue and yel- 
low marbles together is the same as the 
number of green; and that three times 
the number of yellow marbles together 
with the red marbles gives the same 


numbers as the blue marbles. How 
many of each color of marble are in the 
jar? 


. Arescue mission has 85 sandwiches, 65 


bags of chips and 210 cookies. They 
know from experience that men will 
eat 2 sandwiches, 1 bag of chips and 
4 cookies; women will eat 1 sandwich, 
a bag of chips and 2 cookies; kids will 
eat half a sandwhich, a bag of chips and 
3 cookies. If they want to use all their 
food up, how many men, women and 
kids can they feed? 


In Exercises 6 — 15, find the polynomial with 
the smallest degree that goes through the 


given points. 
6. (1,3) and (3, 15) 
7. (2,14) and (3, 4) 
8. (1,5), (—1,3) and (3, —1) 
9. (—4, —3), (0,1) and (1, 4.5) 
10. (—1, —8), (1, —2) and (3, 4) 
11. (—3,3), (1,3) and (2,3) 
12. (—2,15), (—1,4), (1,0) and (2, —5) 
13. (—2,—7), (1,2), (2,9) and (3, 28) 
14. (—3,10), (—1, 2), (1,2) and (2,5) 
15. (0,1), (—3, —3.5), (—2, —2) and (4, 7) 
16. The general exponential function has 


the form f(x) = ae™, where a and bare 
constants and e is Euler’s constant (~ 
2.718). We want to find the equation 
of the exponential function that goes 
through the points (1, 2) and (2, 4). 
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17. 


18. 


(a) Show why we cannot simply sub- 
situte in values for x and y in 
y = ae™ and solve using the 
techniques we used for polyno- 
mials. 


(b) Show how the equality y = ae™ 
leads us to the linear equation 
Iny = Ina+ bx. 

(c) Use the techniques we devel- 
oped to solve for the unknowns 
Ina and b. 

(d) Knowing Ina, find a; find the ex- 
ponential function f(x) = ae™ 
that goes through the points 
(1,2) and (2, 4). 


In a football game, 24 points are scored 
from 8 scoring occasions. The number 
of successful extra point kicks is equal 
to the number of successful two point 
conversions. Find all ways in which the 
points may have been scored in this 
game. 


In a football game, 29 points are scored 
from 8 scoring occasions. There are 2 
more successful extra point kicks than 
successful two point conversions. Find 
all ways in which the points may have 
been scored in this game. 


19. 


20. 


21. 


22. 


23. 


24. 


In a basketball game, where points are 
scored either by a 3 point shot, a2 point 
shot or a 1 point free throw, 80 points 
were scored from 30 successful shots. 
Find all ways in which the points may 
have been scored in this game. 


In a basketball game, where points are 
scored either by a 3 point shot, a2 point 
shot or a 1 point free throw, 110 points 
were scored from 70 successful shots. 
Find all ways in which the points may 
have been scored in this game. 


Describe the equations of the linear 
functions that go through the point 
(1,3). Give 2 examples. 


Describe the equations of the linear 
functions that go through the point 
(2,5). Give 2 examples. 


Describe the equations of the quadratic 
functions that go through the points 
(2, —1) and (1,0). Give 2 examples. 


Describe the equations of the quadratic 
functions that go through the points 
(—1, 3) and (2,6). Give 2 examples. 


MATRIX ARITHMETIC 


A fundamental topic of mathematics is arithmetic; adding, subtracting, multiplying and 
dividing numbers. After learning how to do this, most of us went on to learn how to 
add, subtract, multiply and divide “x”. We are comfortable with expressions such as 


x+3x—x-x*4+x?-x7I 
and know that we can “simplify” this to 


4x — x? +x". 


This chapter deals with the idea of doing similar operations, but instead of an un- 
known number x, we will be using a matrix A. So what exactly does the expression 


A+3A—-A-A2+A°-At 


mean? We are going to need to learn to define what matrix addition, scalar multiplica- 
tion, matrix multiplication and matrix inversion are. We will learn just that, plus some 
more good stuff, in this chapter. 


2.1 Matrix Addition and Scalar Multiplication 


AS YOU READ 


1. When are two matrices equal? 


2. Write an explanation of how to add matrices as though writing to someone who 
knows what a matrix is but not much more. 


3. T/F: There is only 1 zero matrix. 


4. T/F: To multiply a matrix by 2 means to multiply each entry in the matrix by 2. 
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In the past, when we dealt with expressions that used “x,” we didn’t just add and 
multiply x’s together for the fun of it, but rather because we were usually given some 
sort of equation that had x in it and we had to “solve for x.” 

This begs the question, “What does it mean to be equal?” Two numbers are equal, 
when, ..., uh, ..., nevermind. What does it mean for two matrices to be equal? We 
say that matrices A and B are equal when their corresponding entries are equal. This 
seems like a very simple definition, but it is rather important, so we give it a box. 


Definition 8 Matrix Equality 


Two m x n matrices A and B are equal if their corresponding 
entries are equal. 


Notice that our more formal definition specifies that if matrices are equal, they 
have the same dimensions. This should make sense. 

Now we move on to describing how to add two matrices together. To start off, take 
a wild stab: what do you think the following sum is equal to? 


Baek a= 
sx), 


you guessed correctly. That wasn’t so hard, was it? 


Let’s keep going, hoping that we are starting to get on a roll. Make another wild 
guess: what do you think the following expression is equal to? 


1 2 
. =? 
dla |S 


3 6 
fe af 
you guessed correctly! 

Even if you guessed wrong both times, you probably have seen enough in these 
two examples to have a fair idea now what matrix addition and scalar multiplication 
are all about. 

Before we formally define how to perform the above operations, let us first recall 
that if Ais an m x n matrix, then we can write A as 


If you guessed 


If you guessed 


M11 12—— ++ At 

G21 G22, +** ~~ Aan 
A = 

Ami Am2 a A Qmn 


2.1 Matrix Addition and Scalar Multiplication 


Secondly, we should define what we mean by the word scalar. A scalar is any number 
that we multiply a matrix by. (In some sense, we use that number to scale the matrix.) 
We are now ready to define our first arithmetic operations. 


Definition 9 Matrix Addition 


Let A and B be m x n matrices. The sum of A and B, denoted 
A-+ B, is 


Qn+b6y  Gy+by +++ A+ bin 
Q21 +621 G22 +622 +++ Aan + ban 


ami + bm Om2 + bm2 ae Amn + bmn 


Definition 10 Scalar Multiplication 


Let A be an m x rn matrix and let k be a scalar. The scalar 
multiplication of k and A, denoted KA, is 


kay, kay oS kain, 
kan, ka22 oo8 ka2n 


kam1 kam caren kamn 


We are now ready for an example. 


Example 22 Let 
Alec ah Gale oo, eal 
5 De So —-1 0 4 
Simplify the following matrix expressions. 
1.A+8B 3. A-—B 5. —3A+ 2B 7. 5A+5B 
2. B+A 4. A+C 6. A—A 8. 5(A+B) 


SOLUTION 
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oO WwW 
oD) 
Ww wo 


1.A+B= 


pS 
uw 
wo 


oO WwW 
oO 
Ww wo 


2, B+A= 


B 
uw 
wo 


—-1 -2 -3 
3,.A-—B=j];-2 0 -1 
6 5 1 


4. A + Cis not defined. If we look at our definition of matrix addition, we see 
that the two matrices need to be the same size. Since A and C have different 
dimensions, we don’t even try to create something as an addition; we simply 
say that the sum is not defined. 


1 2 3 
5. —3A+ 2B = 5 —2 1 
—17 —-15 —-7 


5 10 15 10 20 30 15 30 45 
7. Strictlyspeaking, thisis |} —5 10 5 /+] 5 10 10} =] 0O 20 15 
25 25 25 —-5 O 20 20 25 45 
8. Strictly speaking, this is 

1 2 3 2 4 6 3 6 9 

5 —1 2 1;+4]1 #2 2 =5-|0 4 3 

5 5 5 —-1 0 4 4 5 9 

15 30 45 

=]|0O 20 15 

20 25 45 


Our example raised a few interesting points. Notice how A+B = B+ A. We 
probably aren’t suprised by this, since we know that when dealing with numbers, a + 
b = b +a. Also, notice that 5A + 5B = 5(A + B). In our example, we were careful to 
compute each of these expressions following the proper order of operations; knowing 
these are equal allows us to compute similar expressions in the most convenient way. 

Another interesting thing that came from our previous example is that 


2.1 Matrix Addition and Scalar Multiplication 


It seems like this should be a special matrix; after all, every entry is O and O is a special 
number. 

In fact, this is a special matrix. We define 0, which we read as “the zero matrix,” 
to be the matrix of all zeros.1 We should be careful; this previous “definition” is a bit 


ambiguous, for we have not stated what size the zero matrix should be. Is i | 


the zero matrix? How about [0 0]? 

Let’s not get bogged down in semantics. If we ever see 0 in an expression, we 
will usually know right away what size 0 should be; it will be the size that allows the 
expression to make sense. If A is a 3 x 5 matrix, and we write A + 0, we'll simply 
assume that 0 is also a 3 x 5 matrix. If we are ever in doubt, we can add a subscript; 
for instance, 02,7 is the 2 x 7 matrix of all zeros. 

Since the zero matrix is an important concept, we give it it’s own definition box. 


Definition 11 The Zero Matrix 


The m x n matrix of all zeros, denoted 0,,.n, is the zero 


matrix. 


When the dimensions of the zero matrix are clear from the 
context, the subscript is generally omitted. 


The following presents some of the properties of matrix addition and scalar multi- 
plication that we discovered above, plus a few more. 


Theorem 2 Properties of Matrix Addition and Scalar Multiplication 


The following equalities hold for all m x n matrices A, B and 
Cand scalars k. 


. A+B =B+A (Commutative Property) 


. (A+B) +C=A-+ (B+C) (Associative Property) 


. k(A+B) = kA+kB (Scalar Multiplication Distributive 
Property) 


. kA =Ak 
.A+0=0+A =A (Additive Identity) 


. OA=0 


Be sure that this last property makes sense; it says that if we multiply any matrix 
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by the number O, the result is the zero matrix, or 0. 
We began this section with the concept of matrix equality. Let’s put our matrix 
addition properties to use and solve a matrix equation. 


Example 23 Let 


Find the matrix X such that 
2A + 3X = —4A. 


SOLUTION We can use basic algebra techniques to manipulate this equation 
for X; first, let’s subtract 2A from both sides. This gives us 
3X = —6A. 
Now divide both sides by 3 to get 
X = —2A. 


Now we just need to compute —2A; we find that 
-4 2 
x= [73 b]- 


Our matrix properties identified 0 as the Additive Identity; i.e., if you add 0 to any 
matrix A, you simply get A. This is similar in notion to the fact that for all numbers a, 
a+0 =a. A Multiplicative Identity would be a matrix / where / x A = A for all matri- 
ces A. (What would such a matrix look like? A matrix of all 1s, perhaps?) However, in 
order for this to make sense, we’ll need to learn to multiply matrices together, which 
we'll do in the next section. 


Exercises 2.1 


Matrices A and B are given below. In Exer- 5. 3(A—B)+8B 


cises 1 — 6, simplify the given expression. 
eee 6, 2(A—B) — (A = 38) 


A= as B= = Matrices A and B are given below. In Exer- 
7 4 5 9 : net : A 
cises 7 — 10, simplify the given expression. 
3 —2 
1.A+B a=|2| a= (77 
2. 2A — 3B 
3. 3A-A 7. 4B—2A 
4. 4B—2A 8. —2A+ 3A 


9. —2A—3A 
10. —B + 3B — 2B 


Matrices A and B are given below. In Exer- 
cises 11 — 14, find X that satisfies the equa- 
tion. 


11. 2A+X=8B 


12. A—X=3B 


13. 3A+ 2X = —1B 
14. A—5X=-B 


In Exercises 15 — 21, find values for the scalars 
a and b that satisfy the given equation. 


2.2 Matrix Multiplication 


AS YOU READ 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


2.2 Matrix Multiplication 


A eo[2I-E 
BJ +f] [a 
Je L 
o|{3]-[5 


1. T/F: Column vectors are used more in this text than row vectors, although some 


other texts do the opposite. 


2. T/F: To multiply A x B, the number of rows of A and B need to be the same. 


3. T/F: The entry in the 2" row and 3" column of the product AB comes from 
multipling the 2"% row of A with the 3 column of B. 


4. Name two properties of matrix multiplication that also hold for “regular multi- 


plication” of numbers. 


5. Name a property of “regular multiplication” of numbers that does not hold for 


matrix multiplication. 


6. T/F:A27 =A-A-A 


In the previous section we found that the definition of matrix addition was very 
intuitive, and we ended that section discussing the fact that eventually we’d like to 


know what it means to multiply matrices together. 


In the spirit of the last section, take another wild stab: what do you think 


B a) *| 


1 
2 


| 
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means? 
You are likely to have guessed 


but this is, in fact, not right.2 The actual answer is 


5 3 

11 5)" 
If you can look at this one example and suddenly understand exactly how matrix mul- 
tiplication works, then you are probably smarter than the author. While matrix multi- 


plication isn’t hard, it isn’t nearly as intuitive as matrix addition is. 
To further muddy the waters (before we clear them), consider 


palx[2 2S): 


Our experience from the last section would lend us to believe that this is not defined, 
but our confidence is probably a bit shaken by now. In fact, this multiplication is de- 


fined, and it is 
5 3 -2 
11 5 -4]" 


You may see some similarity in this answer to what we got before, but again, probably 
not enough to really figure things out. 

So let’s take a step back and progress slowly. The first thing we’d like to do is define 
a special type of matrix called a vector. 


Definition 12 Column and Row Vectors 


Am xX 1 matrix is called a column vector. 


A1 X n matrix is called a row vector. 


While it isn’t obvious right now, column vectors are going to become far more use- 
ful to us than row vectors. Therefore, we often omit the word “column” when refering 
to column vectors, and we just call them “vectors.”? 


21 guess you could define multiplication this way. If you’d prefer this type of multiplication, write your 
own book. 

31n this text, row vectors are only used in this section when we discuss matrix multiplication, whereas 
we’ll make extensive use of column vectors. Other texts make great use of row vectors, but little use of 
column vectors. It is a matter of preference and tradition: “most” texts use column vectors more. 


2.2 Matrix Multiplication 


We have been using upper case letters to denote matrices; we use lower case let- 
ters with an arrow overtop to denote row and column vectors. An example of a row 


vector is 
g=[1 2 -1 0] 


and an example of a column vector is 


Before we learn how to multiply matrices in general, we will learn what it means 
to multiply a row vector by a column vector. 


Definition 13 Multiplying a row vector by a column vector 


Let Ube an1 x nrow vector with entries uj, U2,-:+,U, and 
let Vbe ann x 1 column vector with entries v1, V2, --:, Vn. 
The product of U and V, denoted i - V or WV, is 


n 
) UjVj = U4Vy + UQV2 + +++ + UpnVy. 
i= 


Don’t worry if this definition doesn’t make immediate sense. It is really an easy 
concept; an example will make things more clear. 


Example 24 Let 
- ~ 3 _ 2 
gG=[1 2 3],v=[2 0 1 -1],%=/ 4], y=]¢ 
2 0 
Find the following products. 
1. ux 3. Uy 
5. Xu 
2. vy 4. uv 
SOLUTION 
—2 
1.ax=[1 2 3] | 4 | =1(-2)+2(4)+3(3) =15 
3 
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2.Ww=[2 0 1 -1] = 2(1) + 0(2) + 1(5) — 1(0) =7 


OWN BR 


3. Uy is not defined; Definition 13 specifies that in order to multiply a row vector 
and column vector, they must have the same number of entries. 


4. UV is not defined; we only know how to multipy row vectors by column vec- 
tors. We haven’t defined how to multiply two row vectors (in general, it can’t 
be done). 


5. The product xu is defined, but we don’t know how to do it yet. Right now, we 
only know how to multiply a row vector times a column vector; we don’t know 
how to multiply a column vector times a row vector. (That’s right: Ux 4 xu!) 

Now that we understand how to multiply a row vector by a column vector, we are 

ready to define matrix multiplication. 


Definition 14 Matrix Multiplication 


Let A be anm x r matrix, and let B be an rr x n matrix. The 


matrix product of A and B, denoted A- B, or simply AB, is the 
m Xx nmatrix M whose entry in the i” row and j*" column is 
the product of the i” row of A and the j*" column of B. 


It may help to illustrate it in this way. Let matrix A have rows dj, G5, --+, Gm and let 
B have columns bj, b2, ---, by. Thus A looks like 


1 
a 7 


a 


where the “—” symbols just serve as reminders that the a; represent rows, and B looks 
like 


| | | 
Be be tee Fe 
| | | 


where again, the “|” symbols just remind us that the b; represent column vectors. Then 


“yr 


a,b, aybp +--+ aybn 

ab; ab, +--+ abn 
AB = 

Gmby Gmb2 ++: Ambn 


2.2. Matrix Multiplication 


Two quick notes about this definition. First, notice that in order to multiply A and B, 
the number of columns of A must be the same as the number of rows of B (we refer to 
these as the “inner dimensions”). Secondly, the resulting matrix has the same number 
of rows as A and the same number of co/umns as B (we refer to these as the “outer 
dimensions”). 


final dimensions are the outer 
dimensions 
a 
(m x r) x (r x n) 
“s’ 


these inner dimensions 


must match 


Of course, this will make much more sense when we see an example. 


Example 25 Revisit the matrix product we saw at the beginning of this section; 
multiply 
1 2 1 -1 O 
3 4}/}2 2 -1)}° 
SOLUTION Let’s call our first matrix A and the second B. We should first check 


to see that we can actually perform this multiplication. Matrix A is 2 x 2 and B is 
2 x 3. The “inner” dimensions match up, so we can compute the product; the “outer” 
dimensions tell us that the product will be 2 x 3. Let 
m m m 
AB = 11 12 13 | 
M21 M22 -M23 


Let’s find the value of each of the entries. 
The entry mj, is in the first row and first column; therefore to find its value, we 
need to multiply the first row of A by the first column of B. Thus 


1 


m,=[1 2] H = 1(1)4+2(2) =5. 


So now we know that 


48 | 5 My wae 


Ma. M22) M23 


Finishing out the first row, we have 


—1 


m2=[1 2] | 5 = 1(-1) + 2(2) =3 


using the first row of A and the second column of B, and 
0 
m3=([1 2] | = 1(0) + 2(-1) = -2 
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using the first row of A and the third column of B. Thus we have 


5 3 —2 
M21 M22 M23 | - 


AB = | 
To compute the second row of AB, we multiply with the second row of A. We find 


mi=([3 4] H = 11, 


and 
y | si 


Thus 
1 2}//1 -1 O 5 3 -2 
a= |; allls 2 oleae 5 =k 
Example 26 Multiply 
: o E 4.4 | 
“4 3 267 9 
SOLUTION Let’s first check to make sure this product is defined. Again calling 


the first matrix A and the second B, we see that A is a3 X 2 matrix and Bisa2 x 4 
matrix; the inner dimensions match so the product is defined, and the product will be 


a3 x 4 matrix, 
Mit M120 -M13—— M14 


AB=}m21 M22 M23, M24 
M31 M32 M33 ~:M34 


We will demonstrate how to compute some of the entries, then give the final an- 
swer. The reader can fill in the details of how each entry was computed. 


mi=[1 —1] H =-1. 
m3=[1 —-1] B =-6. 


m3=([5 2] H = 16, 


Xe) 


ie= 16 2] | 


2.2. Matrix Multiplication 


mz =[-2 3] E = 16. 


my =[-2 3] B = 25. 


So far, we’ve computed this much of AB: 


© 


—-1 mz —-6 my 
AB={|m1 M22 19 23 
M31 16 M33 25 


The final product is 


4 16 19 25 


Example 27 Multiply, if possible, 
2 3 4//3 6 
9 8 7 5 -1]° 
SOLUTION Again, we’ll call the first matrix A and the second B. Checking the 


dimensions of each matrix, we see that A is a 2 x 3 matrix, whereas Bis a 2 x 2 matrix. 
The inner dimensions do not match, therefore this multiplication is not defined. 


Example 28 In Example 24, we were told that the product xu was defined, 
where 
—2 
X= | 4 and G=[1 2 3], 
3 


although we were not shown what that product was. Find xu. 


SOLUTION Again, we need to check to make sure the dimensions work cor- 
rectly (remember that even though we are referring to U and x as vectors, they are, in 
fact, just matrices). 

The column vector x has dimensions 3 x 1, whereas the row vector u has dimen- 
sions 1 x 3. Since the inner dimensions do match, the matrix product is defined; the 
outer dimensions tell us that the product will be a 3 x 3 matrix, as shown below: 


m1 M12) «M13 
xUuU= {M1 M2) M23 
M31 M32 =~M33 


To compute the entry m,;, we multiply the first row of x by the first column of U. 
What is the first row of xX? Simply the number —2. What is the first column of @? Just 
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the number 1. Thus my, = —2. (This does seem odd, but through checking, you can 
see that we are indeed following the rules.) 

What about the entry m2? Again, we multiply the first row of x by the first column 
of u; that is, we multiply —2(2). So my. = —4. 

What about m3? Multiply the second row of x by the third column of d; multiply 
4(3), So m23 = 12. 

One final example: m3, comes from multiplying the third row of x, which is 3, by 
the first column of U, which is 1. Therefore m3; = 3. 

So far we have computed 


—2 —4 M43 
xu = M21 M22 12 
3 M32_—-M33 


After performing all 9 multiplications, we find 


2 -4 -6 
i=|4 8 12 
3 6 9 


In this last example, we saw a “nonstandard” multiplication (at least, it felt non- 
standard). Studying the entries of this matrix, it seems that there are several different 
patterns that can be seen amongst the entries. (Remember that mathematicians like 
to look for patterns. Also remember that we often guess wrong at first; don’t be scared 
and try to identify some patterns.) 


In Section 2.1, we identified the zero matrix 0 that had a nice property in relation to 
matrix addition (i.e., A+0 = A for any matrix A). In the following example we'll identify 
a matrix that works well with multiplication as well as some multiplicative properties. 
For instance, we’ve learned how 1- A = A; is there a matrix that acts like the number 
1? That is, can we find a matrix X where X- A = A?* 


Example 29 Let 
1 2 3 111 
A= 2 -7 5), B=);1 1 1 
—2 -8 3 111 
102 1 0 0 
c=|]2 1 0], /=]0 1 0 
021 001 
Find the following products. 
1. AB 3. A03x4 5. IA 7. BC 
2. BA 4. Al 6. 8. B? 


4We made a guess in Section 2.1 that maybe a matrix of all 1s would work. 


2.2 Matrix Multiplication 


SOLUTION We will find each product, but we leave the details of each computa- 
tion to the reader. 


1 2 3 111 6 6 6 
1. AB= 2 -7 5 1 11)]= 0 0 0 
—2 -8 3 111 —7 -7 -7 
111 1 2 3 1 -13 11 
2.BA=;1 11 2 -7 5S|=]1 -13 11 
111 —2 -8 3 1 -13 11 
3 A034 = 034 
1 2 3 1 0 0 1 2 3 
4. Al = 2 -7 5 01 0;= 2 -—7 5 
—2 -8 3 001 —2 -8 3 
1 0 0 1 2 3 1 2 3 
5. A=j|;0 1 0 2 -—7 S|= 2 -7 5 
001 —2 -8 3 —2 -8 3 


6. We haven’t formally defined what /* means, but we could probably make the 
reasonable guess that /* = /-/. Thus 


10o0/]/100 10 0 

P=!010]}/0 1 0}/=/]0 10 

001]//0 01 001 
111 102 3 3 3 
7.BC=]1 11 210/=/3 3 3 
1141//021 3 3 3 
111 111 3 3 3 
8.B?=BB=/]11 1 111/=/3 3 3 
111 111 3 3 3 


This example is simply chock full of interesting ideas; it is almost hard to think about 
where to start. 

Interesting Idea #1: Notice that in our example, AB ~ BA! When dealing with 
numbers, we were used to the idea that ab = ba. With matrices, multiplication is 
not commutative. (Of course, we can find special situations where it does work. In 
general, though, it doesn’t.) 

Interesting Idea #2: Right before this example we wondered if there was a matrix 
that “acted like the number 1,” and guessed it may be a matrix of all 1s. However, 
we found out that such a matrix does not work in that way; in our example, AB 4 A. 
We did find that AJ = JA = A. There is a Multiplicative Identity; it just isn’t what we 
thought it would be. And just as 12 = 1, ? =1. 

Interesting Idea #3: When dealing with numbers, we are very familiar with the 
notion that “If ax = bx, thena = b.” (As long as x 4 0.) Notice that, in our example, 
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BB = BC, yet B # C. In general, just because AX = BX, we cannot conclude thatA = B. 


Matrix multiplication is turning out to be a very strange operation. We are very 
used to multiplying numbers, and we know a bunch of properties that hold when using 
this type of multiplication. When multiplying matrices, though, we probably find our- 
selves asking two questions, “What does work?” and “What doesn’t work?” We’ll an- 


swer these questions; first we’ll do an example that demonstrates some of the things 
that do work. 


Example 30 Let 


Find the following: 


1. A(B+C) 3. A(BC) 
2. AB +AC 4. (AB)C 
SOLUTION We'll compute each of these without showing all the intermediate 


steps. Keep in mind order of operations: things that appear inside of parentheses 
are computed first. 


| (+c) =| 3 “RCE 4 F 1) 
“B alb @ 
Lt | 


2.2. Matrix Multiplication 


In looking at our example, we should notice two things. First, it looks like the “dis- 
tributive property” holds; that is, A(B + C) = AB + AC. This is nice as many algebraic 
techniques we have learned about in the past (when doing “ordinary algebra”) will still 
work. Secondly, it looks like the “associative property” holds; that is, A(BC) = (AB)C. 
This is nice, for it tells us that when we are multiplying several matrices together, we 
don’t have to be particularly careful in what order we multiply certain pairs of matrices 


together.° 


In leading to an important theorem, let’s define a matrix we saw in an earlier ex- 


ample.® 


Definition 15 


Identity Matrix 


The n x n matrix with 1’s on the diagonal and zeros else- 


where is the n x n identity matrix, denoted |,. When the 
context makes the dimension of the identity clear, the sub- 
script is generally omitted. 


Note that while the zero matrix can come in all different shapes and sizes, the 


5Be careful: in computing ABC together, we can first multiply AB or BC, but we cannot change the order 
in which these matrices appear. We cannot multiply BA or AC, for instance. 

The following definition uses a term we won't define until Definition 20 on page 123: diagonal. In short, 
a “diagonal matrix” is one in which the only nonzero entries are the “diagonal entries.” The examples given 
here and in the exercises should suffice until we meet the full definition later. 
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identity matrix is always a square matrix. We show a few identity matrices below. 


1 0 0 0 

1 0 a 0100 
b= Oo #2 lZ=}0 1 0}, h= 0010 
es 000 1 


In our examples above, we have seen examples of things that do and do not work. 
We should be careful about what examples prove, though. If someone were to claim 
that AB = BA is always true, one would only need to show them one example where 
they were false, and we would know the person was wrong. However, if someone 
claims that A(B + C) = AB + AC is always true, we can’t prove this with just one 
example. We need something more powerful; we need a true proof. 

In this text, we forgo most proofs. The reader should know, though, that when 
we state something in a theorem, there is a proof that backs up what we state. Our 
justification comes from something stronger than just examples. 

Now we give the good news of what does work when dealing with matrix multipli- 
cation. 


Theorem 3 Properties of Matrix Multiplication 


Let A, B and C be matrices with dimensions so that the fol- 
lowing operations make sense, and let k be a scalar. The 
following equalities hold: 


1. A(BC) = (AB)C (Associative Property) 


2. A(B+C) = AB+ABand 
(B+ C)A = BA + CA (Distributive Property) 


_ k(AB) = (kA)B = A(kB) 


MASA 


The above box contains some very good news, and probably some very surprising 
news. Matrix multiplication probably seems to us like a very odd operation, so we 
probably wouldn’t have been surprised if we were told that A(BC) 4 (AB)C. Itisa 
very nice thing that the Associative Property does hold. 

As we near the end of this section, we raise one more issue of notation. We define 
A° = |. If nis a positive integer, we define 


A°=A-A---- -A. 
«J 
ntimes 


With numbers, we are used to a~" = =. Do negative exponents work with matri- 
ces, too? The answer is yes, sort of. We’ll have to be careful, and we’ll cover the topic 


2.2 Matrix Multiplication 


in detail once we define the inverse of a matrix. For now, though, we recognize the 


fact that A~? x i for 7 makes no sense; we don’t know how to “divide” by a matrix. 


We end this section with a reminder of some of the things that do not work with 
matrix multiplication. The good news is that there are really only two things on this 
list. 


1. Matrix multiplication is not commutative; that is, AB ~ BA. 
2. In general, just because AX = BX, we cannot conclude that A = B. 


The bad news is that these ideas pop up in many places where we don’t expect them. 
For instance, we are used to 


(a+b)? =a? + 2ab + b’. 
What about (A + B)? All we'll say here is that 
(A+B)? #A? + 2AB + B’; 


we leave it to the reader to figure out why. 
The next section is devoted to visualizing column vectors and “seeing” how some 
of these arithmetic properties work together. 


Exercises 2.2 


In Exercises 1 — 12, row and column vectors U 1 
and Vare defined. Find the product uv, where 8 G=[-3 6 1] ¥=]-1 
possible. 1 
1@=[1 -4] i= |? S(t 2-344) 
1 
2.4=[2 3] v=|7, v= 7 
-1 
+ Es 3 
a= 1s: <4] r= |3| 10.¢=[6 2 -1 2] 
z Pee ek: 3 
4.u0=[06 08] 7=|b8| ss [2 
9 
2 5 
5.uG=/1 2 -1 v- [3 3 
7 17s [t 23] 7=|3| 
—1 
6. G={3 2 -2] v=] 0 1 
: s 12.¢=[2 -5] v=]}1 
1 
2 
7.c¢0=!/8 -4 3) v=]/4 In Exercises 13 — 27, matrices A and B are de- 
: : E fined. 
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(b 


— 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 
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Give the dimensions of A and B. If the 2.A- 1 ‘] 
dimensions properly match, give the [7 6 
dimensions of AB and BA. os 1 -1 -5 5 
Find the products AB and BA, if possi- —2 1 a 
ble. r 
—1 5 
1 2 2 8 Spo de | 
A= B= : 
-1 4 3-1 5 -3 -4 eH 
B= 
3 yl E >| —2 -5 -5 -1 
A= B= 
2 5 3 -3 -12 1 
Ss 3-41 24. A= - ; a 
= 5, -<g = 
ga) es le 
~ 14 2 9 = a 
1 0 O 
: 1 1 1 
EH See cay | ee eae ee 
—2 -4 ne |i % = 
as 1 1 —2 
—2 0 
B=|, 2 -2 -2 -2 
B= 0 —2 O 
9 4 3 —2 O 2 
; E —5 | 
-4 3 3 
pa|~2 9° 26. A=|-5 -1 —-5 
[-2 -1 S65. 205. : 4 
—2 -1 0 0 
A=|9 -5 B=|-5 —4 3 
3 -1 5 -4 3 
B= Ei e =] af eS: «9 
0, 28 aS 27.A=|2 -3 5 
2 6 1 5 3 
A=|6 2 —2 4 3 
5 -1 B= | =1,.-1t. =1 
a E 5 O 4 0 2 
-4 4 —4 In Exercises 28 — 33, a diagonal matrix Danda 
matrix A are given. Find the products DA and 
=5. -<2 . 
AD, where possible. 
A=|-5 -2 
—5 -4 [ 
28. D= 2 oe 
pale =e & [9-2 
~|-5 =—3 —1 Wien tee 
~ 16 8 
8 -2 7 
A=]4 5 [4 0 
2 -5 Gerre = [0 | 
—-5 1 -5 J1 2 
a: 3 | eal ‘| 


—-1 0 0 
30.D=] 0 2 0 
0 O 3 
1 2 3 
A=/]4 5 6 
7 8 9 
1 1 1 
31. D= 2 2 2 
—-3 -3 -3 
2 OO O 
A=|]0 -3 0 
0 oO 5 
dq, O 
2. o=(¢ Z| 
a b 
ape 
dq O 0 
33. D=|]0 dad O 
0 O a3 
abe 
A=|d e f 
g hi 


In Exercises 34 — 39, a matrix A and a vector X 


are given. Find the product Ax. 


2 3 = 4 
zan[? 3), 2[f 


—-1 4 - 2 
Sei eels 
2 0 3 1 
36. A=|]1 1 1|, *=/4 
3 -1 2 2 
—2 0 3 4 
37. A= 1 1 -2], ¥=1}]3 
4 2 -1 1 
_ 2 —1 a X1 
sae [2 2]. v= [2] 


39. 


40. 


41. 


42. 


43. 


44. 


2.2 Matrix Multiplication 


R 
N 
w 
x< 

a 


A=|]1 0 2], K=|]x 
23 1 X3 
ee ee ree 3 
LetA = E 5 |: Find andA>. 
we [Be Oh ee) od 3 
LetA = Ee 3 |: Find andA®. 
-1 00 
LettA = | 0 3 OJ. Find A? and 
0 05 
A’. 
0 1 0 
LettA=|0O 0 1)]. Find A? and A?. 
1 0 0 
001 
LettA=|0O O OJ]. Find A? and A?. 
0 1 0 
. In the text we state that (A + B)? # 


A’? +2AB+B. We investigate that claim 
here. 


(a) LetA = ‘ | ana let B = 


—-3 -2 


—-5 -5 

& 1 |. Compute A + 8 

(b) Find (A + B)* by using your an- 
swer from (a). 


(c) Compute A? + 2AB + B?. 


(d) Are the results from (a) and (b) 
the same? 


(e) Carefully expand the expression 
(A+B) = (A+ B)(A + B) 
and show why this is not equal to 
A’ + 2AB + B’. 
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2.3. Visualizing Matrix Arithmetic in 2D 


AS YOU READ 


1. T/F: Two vectors with the same length and direction are equal even if they start 
from different places. 


2. One can visualize vector addition using what law? 
3. T/F: Multiplying a vector by 2 doubles its length. 
4. What do mathematicians do? 


5. T/F: Multiplying a vector by a matrix always changes its length and direction. 


When we first learned about adding numbers together, it was useful to picture a 
number line: 2+ 3 = 5 could be pictured by starting at 0, going out 2 tick marks, then 
another 3, and then realizing that we moved 5 tick marks from O. Similar visualizations 
helped us understand what 2 — 3 meant and what 2 x 3 meant. 

We now investigate a way to picture matrix arithmetic — in particular, operations 
involving column vectors. This not only will help us better understand the arithmetic 
operations, it will open the door to a great wealth of interesting study. Visualizing 
matrix arithmetic has a wide variety of applications, the most common being computer 
graphics. While we often think of these graphics in terms of video games, there are 
numerous other important applications. For example, chemists and biologists often 
use computer models to “visualize” complex molecules to “see” how they interact with 
other molecules. 

We will start with vectors in two dimensions (2D) — that is, vectors with only two 
entries. We assume the reader is familiar with the Cartesian plane, that is, plotting 
points and graphing functions on “the x-y plane.” We graph vectors in a manner very 
similar to plotting points. Given the vector 


(i 


we draw x by drawing an arrow whose tip is 1 unit to the right and 2 units up from its 
ee 7 
origin. 


Figure 2.1: Various drawings of x 


7To help reduce clutter, in all figures each tick mark represents one unit. 


2.3 Visualizing Matrix Arithmetic in 2D 


When drawing vectors, we do not specify where you start drawing; all we specify 
is where the tip lies based on where we started. Figure 2.1 shows vector x drawn 3 
ways. In some ways, the “most common” way to draw a vector has the arrow start at 
the origin, but this is by no means the only way of drawing the vector. 

Let’s practice this concept by drawing various vectors from given starting points. 


Example 31 Let 


[A] 8] ~ [2] 


Draw xX starting from the point (0, —1); draw y starting from the point (—1, —1), and 
draw Z starting from the point (2, —1). 


SOLUTION To draw x, start at the point (0, —1) as directed, then move to the 
right one unit and down one unit and draw the tip. Thus the arrow “points” from 
(0, —1) to (1, —2). 

To draw y, we are told to start and the point (—1, —1). We draw the tip by moving 
to the right 2 units and up 3 units; hence y points from (—1, —1) to (1,2). 

To draw Z, we start at (2, —1) and draw the tip 3 units to the left and 2 units up; Z 
points from (2, —1) to (—1, 1). 

Each vector is drawn as shown in Figure 2.2. 


Figure 2.2: Drawing vectors x, y and Z in Example 31 


How does one draw the zero vector, 0 = ?8 Following our basic procedure, 


0 
0 
we start by going 0 units in the x direction, followed by 0 units in the y direction. In 
other words, we don’t go anywhere. In general, we don’t actually draw 0. At best, 
one can draw a dark circle at the origin to convey the idea that 0, when starting at the 
origin, points to the origin. 

In section 2.1 we learned about matrix arithmetic operations: matrix addition and 
scalar multiplication. Let’s investigate how we can “draw” these operations. 


8 Vectors are just special types of matrices. The zero vector, 0,isa special type of zero matrix, 0. It helps 
to distinguish the two by using different notation. 
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Vector Addition 


Given two vectors X and y, how do we draw the vector xX + y? Let’s look at this in 
the context of an example, then study the result. 


Example 32 Let 
s 1 =~ |3 
z=(1| and y=|i]. 
Sketch xX, Yand xX + ¥. 
SOLUTION A starting point for drawing each vector was not given; by default, 


we'll start at the origin. (This is in many ways nice; this means that the vector H 


“points” to the point (3,1).) We first compute xX + y: 


cere ll+ (a= [2 


Sketching each gives the picture in Figure 2.3. 


Figure 2.3: Adding vectors X and y in Example 32 


This example is pretty basic; we were given two vectors, told to add them together, 
then sketch all three vectors. Our job now is to go back and try to see a relationship 
between the drawings of x, y and x + y. Do you see any? 

Here is one way of interpreting the adding of x to y. Regardless of where we start, 
we draw x. Now, from the tip of x, draw y¥. The vector X + y is the vector found 
by drawing an arrow from the origin of xX to the tip of y. Likewise, we could start by 
drawing jy. Then, starting from the tip of ¥, we can draw x. Finally, draw xX + y by 
drawing the vector that starts at the origin of y and ends at the tip of x. 

The picture in Figure 2.4 illustrates this. The gray vectors demonstrate drawing the 
second vector from the tip of the first; we draw the vector x + y dashed to set it apart 
from the rest. We also lightly filled the parallelogram whose opposing sides are the 


vectors xX and y. This highlights what is known as the 


x! 


2.3 Visualizing Matrix Arithmetic in 2D 


Parallelogram Law. 


X+y 


x 


Figure 2.4: Adding vectors graphically using the Parallelogram Law 


Key Idea 5 Parallelogram Law 


To draw the vector xX + y, one can draw the parallelogram 


with X and ¥ as its sides. The 
vertex where x and y originate 
meet is the vector x + y. 


vector that points from the 
to the vertex where x and y 


Knowing all of this allows us to draw the sum of two vectors without knowing 
specifically what the vectors are, as we demonstrate in the following example. 


Example 33 
vector X + y. 


SOLUTION 


<1 
I 


Consider the vectors X and y as drawn in Figure 2.5. Sketch the 


Figure 2.5: Vectors x and y in Example 33 


We'll apply the Parallelogram Law, as given in Key Idea 5. As before, we draw X + 


dashed to set it apart. The result is given in Figure 2.6. 


X+y 


Figure 2.6: Vectors X, Yand x + y 


in Example 33 
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Scalar Multiplication 


After learning about matrix addition, we learned about scalar multiplication. We 
apply that concept now to vectors and see how this is represented graphically. 


Example 34 Let 


Sketch x, y¥, 3X and —1y. 


SOLUTION We begin by computing 3x and —y: 


= 3 “4 2 
ax= (3 and -5=| 2]. 


All four vectors are sketched in Figure 2.7. 


Figure 2.7: Vectors xX, y, 3X and —y in Example 34 


As we often do, let us look at the previous example and see what we can learn from 
it. We can see that xX and 3x point in the same direction (they lie on the same line), but 
3x is just longer than x. (In fact, it looks like 3x is 3 times longer than xX. Is it? How do 
we measure length?) 

We also see that y and —/ seem to have the same length and lie on the same line, 
but point in the opposite direction. 

A vector inherently conveys two pieces of information: length and direction. Mul- 
tiplying a vector by a positive scalar c stretches the vectors by a factor of c; multiplying 
by a negative scalar c both stretches the vector and makes it point in the opposite 
direction. 

Knowing this, we can sketch scalar multiples of vectors without knowing specifi- 
cally what they are, as we do in the following example. 


Example 35 Let vectors X and y be as in Figure 2.8. Draw 3x, —2x, and Sy. 


2.3 Visualizing Matrix Arithmetic in 2D 


! 


x1 


Figure 2.8: Vectors x and y in Example 35 


SOLUTION To draw 3x, we draw a vector in the same direction as x, but 3 times 
as long. To draw —2x, we draw a vector twice as long as xX in the opposite direction; 
to draw SV, we draw a vector half the length of y in the same direction as y. We again 
use the default of drawing all the vectors starting at the origin. All of this is shown in 
Figure 2.9. 


2x 


Figure 2.9: Vectors x, y, 3X, —2x and 5x in Example 35 


Vector Subtraction 


The final basic operation to consider between two vectors is that of vector sub- 
traction: given vectors x and y, how do we draw x — y? 

If we know explicitly what X and y are, we can simply compute what x — yis and 
then draw it. We can also think in terms of vector addition and scalar multiplication: 
we can add the vectors x + (—1)y. That is, we can draw x and draw —y, then add them 
as we did in Example 33. This is especially useful we don’t know explicitly what x and 
y are. 


Example 36 Let vectors X and y be as in Figure 2.10. Draw x — y. 


y 


<1 


Figure 2.10: Vectors x and y in Example 36 
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SOLUTION To draw x — y, we will first draw —y and then apply the Parallelogram 
Law to add x to —y. See Figure 2.11. 


Figure 2.11: Vectors x, ¥ and X — yin Example 36 


In Figure 2.12, we redraw Figure 2.11 from Example 36 but remove the gray vectors 
that tend to add clutter, and we redraw the vector x — y dotted so that it starts from the 
tip of y.° Note that the dotted version of x — y points from y to x. This is a “shortcut” 
to drawing xX — y; simply draw the vector that starts at the tip of y and ends at the tip 
of x. This is important so we make it a Key Idea. 


y 


Figure 2.12: Redrawing vector X — y 


Key Idea 6 Vector Subtraction 


To draw the vector x — y, draw X and y'so that they have the 
same origin. The vector X — y is the vector that starts from 
the tip of y and points to the tip of x. 


Let’s practice this once more with a quick example. 
Example 37 Let X and y be as in Figure ?? (a). Draw X — y. 


SOLUTION We simply apply Key Idea 6: we draw an arrow from y to x. We do 
so in Figure 2.13; x — y is dashed. 


72 °Remember that we can draw vectors starting from anywhere. 
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<I 
<1 


x1 
<1 


(a) (b) 


Figure 2.13: Vectors x, ¥and X — yin Example 37 


Vector Length 


When we discussed scalar multiplication, we made reference to a fundamental 
question: How do we measure the length of a vector? Basic geometry gives us an 
answer in the two dimensional case that we are dealing with right now, and later we 
can extend these ideas to higher dimensions. 

Consider Figure 2.14. A vector x is drawn in black, and dashed and dotted lines 
have been drawn to make it the hypotenuse of a right triangle. 


Figure 2.14: Measuring the length of a vector 


It is easy to see that the dashed line has length 4 and the dotted line has length 3. 
We'll let c denote the length of x: according to the Pythagorean Theorem, 42 +37 = c?. 
Thus c* = 25 and we quickly deduce that c = 5. 

Notice that in our figure, X goes to the right 4 units and then up 3 units. In other 
words, we can write 

-(3] 
3|° 


We learned above that the length of x is \/4? + 37.1 This hints at a basic calculation 
that works for all vectors xX, and we define the length of a vector according to this rule. 


10Remember that \/4* + 37 #443! 
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Let 


= x. 
Fal ls 
X2 


The /ength of xX, denoted ||x’|, is 


I = oe +8. 


Example 38 Find the length of each of the vectors given below. 
ve 1 wa 2 Za 6 ue 3 
sae | = |llog 8 pea 
SOLUTION We apply Definition 16 to each vector. 


||xa|| = V1? +12 = V2. 
P|] = 2 + (-3)? = V4B. 
||x3|| = v.67 + .87 = 36+ .64=1. 


\|xa|| = V3? +0=3. 


Now that we know how to compute the length of a vector, let’s revisit a statement 
we made as we explored Examples 34 and 35: “Multiplying a vector by a positive scalar 
c stretches the vectors by a factor ofc...” At that time, we did not know how to mea- 
sure the length of a vector, so our statement was unfounded. In the following example, 
we will confirm the truth of our previous statement. 


Example 39 Letx = aah Compute ||x||, ||3x]|, || — 2x]||, and ||cx||, where c 
is a scalar. 

SOLUTION We apply Definition 16 to each of the vectors. 

|x|] = /44+1= V5. 

Before computing the length of ||3x||, we note that 3x = | i : 


||3xX]| = /36 +9 = V45 = 3/5 = 3] |x|. 
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Before computing the length of || — 2x||, we note that —2x = | : 


|| — 2x|| = 16 + 4 = V20 = 2/5 =2||x'||. 


: > > 2c 
Finally, to compute ||cx/|, we note that cx = | . Thus: 


||cx]| = \/(2c)2 + (—c)? = V4e2 + 2 = V5e2 = |c|V5. 


This last line is true because the square root of any number squared is the absolute 
value of that number (for example, ,/(—3) = 3). 


The last computation of our example is the most important one. It shows that, 
in general, multiplying a vector x by a scalar c stretches x by a factor of |c| (and the 
direction will change if c is negative). This is important so we’ll make it a Theorem. 


Theorem 4 Vector Length and Scalar Multiplication 


Let X be a vector and let c be a scalar. Then the length of cx 
is 


[lex] = [el - [|x]. 


Matrix — Vector Multiplication 


The last arithmetic operation to consider visualizing is matrix multiplication. Specif- 
ically, we want to visualize the result of multiplying a vector by a matrix. In order to 
multiply a 2D vector by a matrix and get a 2D vector back, our matrix must be a square, 
2 x 2 matrix. 


We’ll start with an example. Given a matrix A and several vectors, we’ll graph the 
vectors before and after they’ve been multiplied by A and see what we learn. 


Example 40 Let A be a matrix, and x, y, and Z be vectors as given below. 


a el eee I a) 


Graph x, y and Z, as well as Ax, Ay and Az. 


SOLUTION 


11We can multiply a 3 x 2 matrix by a 2D vector and get a 3D vector back, and this gives very interesting 
results. See section 5.2. 
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Figure 2.15: Multiplying vectors by a matrix in Example 40. 


It is straightforward to compute: 


- 5 4 3 - —1 
a= |, a= |7]. and a= |]. 


The vectors are sketched in Figure 2.15 


There are several things to notice. When each vector is multiplied by A, the result 
is a vector with a different length (in this example, always longer), and in two of the 
cases (for y and Z), the resulting vector points in a different direction. 

This isn’t surprising. In the previous section we learned about matrix multiplica- 
tion, which is a strange and seemingly unpredictable operation. Would you expect to 
see some sort of immediately recognizable pattern appear from multiplying a matrix 
and a vector?” In fact, the surprising thing from the example is that x and Ax point 
in the same direction! Why does the direction of X not change after multiplication by 
A? (We'll answer this in Section 4.1 when we learn about something called “eigenvec- 
tors.”) 

Different matrices act on vectors in different ways.? Some always increase the 
length of a vector through multiplication, others always decrease the length, others 
increase the length of some vectors and decrease the length of others, and others still 
don’t change the length at all. A similar statement can be made about how matrices 
affect the direction of vectors through multiplication: some change every vector’s di- 
rection, some change “most” vector’s direction but leave some the same, and others 
still don’t change the direction of any vector. 

How do we set about studying how matrix multiplication affects vectors? We could 
just create lots of different matrices and lots of different vectors, multiply, then graph, 
but this would be a lot of work with very little useful result. It would be too hard to 
find a pattern of behavior in this.“ 


12This is a rhetorical question; the expected answer is “No.” 
13 That’s one reason we call them “different.” 
14Remember, that’s what mathematicians do. We look for patterns. 


2.3 Visualizing Matrix Arithmetic in 2D 


Instead, we’ll begin by using a technique we’ve employed often in the past. We 
have a “new” operation; let’s explore how it behaves with “old” operations. Specifi- 
cally, we know how to sketch vector addition. What happens when we throw matrix 
multiplication into the mix? Let’s try an example. 


Example 41 Let A be a matrix and X and y be vectors as given below. 
1 1 = 2 = —1 
Sketch x + y, Ax, Ay, and A(x + y). 
SOLUTION It is pretty straightforward to compute: 
er 1 e 3 = 0 py be 3 
X+y= B > AX= la |: Ay = H , AX +y) = El : 
In Figure 2.16, we have graphed the above vectors and have included dashed gray 
vectors to highlight the additive nature of x+y and A(x + ¥). Does anything strike you 
as interesting? 


y A(x + y) 


Figure 2.16: Vector addition and matrix multiplication in Example 41. 


Let’s not focus on things which don’t matter right now: let’s not focus on how long 
certain vectors became, nor necessarily how their direction changed. Rather, think 
about how matrix multiplication interacted with the vector addition. 

In some sense, we started with three vectors, x, ¥, and xX + y. This last vector is 
special; it is the sum of the previous two. Now, multiply all three by A. What happens? 
We get three new vectors, but the significant thing is this: the last vector is still the 
sum of the previous two! (We emphasize this by drawing dotted vectors to represent 
part of the Parallelogram Law.) 
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Of course, we knew this already: we already knew that Ax + Ay = A(x + y), for 
this is just the Distributive Property. However, now we get to see this graphically. 


In Section 5.1 we’ll study in greater depth how matrix multiplication affects vec- 
tors and the whole Cartesian plane. For now, we’ll settle for simple practice: given a 
matrix and some vectors, we’ll multiply and graph. Let’s do one more example. 
Example 42 Let A, x, y, and Z be as given below. 

Pe aes ae ed. _ [-1 z_|4 
Sa. ae Sa Sh ge el 
Graph x, y and Z, as well as Ax, Ay and Az. 


SOLUTION 


Ay Y 
Figure 2.17: Multiplying vectors by a matrix in Example 42. 


It is straightforward to compute: 


- 0 _ —2 - 3 
a= [5], a=|73). and “z= |3]. 


The vectors are sketched in Figure 2.17. 

These results are interesting. While we won’t explore them in great detail here, 
notice how x got sent to the zero vector. Notice also that Ax, Ay and AZ are allina 
line (as well as x!). Why is that? Are x, y and Z just special vectors, or would any other 
vector get sent to the same line when multiplied by A??? 


This section has focused on vectors in two dimensions. Later on in this book, we’ll 
extend these ideas into three dimensions (3D). 

In the next section we’ll take a new idea (matrix multiplication) and apply it to an 
old idea (solving systems of linear equations). This will allow us to view an old idea in 
a new way — and we'll even get to “visualize” it. 


15Don’t just sit there, try it out! 
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2.3 Visualizing Matrix Arithmetic in 2D 


In Exercises 1 — 4, vectors X and y are given. 
Sketch x, y, x+y, and X — yon the same Carte- 


sian axes. 
[i]. 
x= ele 
le 
i 5.7 


In Exercises 5 — 8, vectors X and / are drawn. 
Sketch 2x, —y, X + y, and X — yon the same 


Cartesian axes. 


y 
x 
5. i 
1 
6. 
A 
x 

8. 1 


y 


In Exercises 9 — 12, a vector X and a scalar 
a are given. Using Definition 16, compute 


the lengths of X and ax, then compare these 


lengths. 
9.xXx= [i]e=3 
10. X= [F|a=-2 
11. X= |e )e=4 
12,.5.= cg a 


13. Four pairs of vectors xX and ¥ are given 
below. For each pair, compute ||x/|, 
|ly||, and ||x + y||. Use this information 
to answer: Is it always, sometimes, or 
never true that ||x|| + ||¥]| = ||x + y||? 
If it always or never true, explain why. 
If it is sometimes true, explain when it 
is true. 


(d) X= 


Aige) 


In Exercises 14 — 17, a matrix A is given. 
Sketch x, ¥, AX and Ay on the same Cartesian 
axes, where 


rae H and 7 = Bak 


14. A= E el 
15. A= be | 
16. A= Fi | 

aa ie 5 
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2.4 Vector Solutions to Linear Systems 


AS YOU READ 


1. T/F: The equation Ax = b is just another way of writing a system of linear equa- 
tions. 


2. T/F: In solving AX = 0, if there are 3 free variables, then the solution will be 
“oulled apart” into 3 vectors. 


3. T/F: Ahomogeneous system of linear equations is one in which all of the coeffi- 
cients are O. 


4. Whether or not the equation Ax = b has a solution depends on an intrinsic 
property of 


The first chapter of this text was spent finding solutions to systems of linear equa- 
tions. We have spent the first two sections of this chapter learning operations that 
can be performed with matrices. One may have wondered “Are the ideas of the first 
chapter related to what we have been doing recently?” The answer is yes, these ideas 
are related. This section begins to show that relationship. 

We have often hearkened back to previous algebra experience to help understand 
matrix algebra concepts. We do that again here. Consider the equation ax = b, where 
a = 3 and b = 6. If we asked one to “solve for x,” what exactly would we be asking? 
We would want to find a number, which we call x, where a times x gives b; in this case, 
it is a number, when multiplied by 3, returns 6. 

Now we consider matrix algebra expressions. We’ll eventually consider solving 
equations like AX = B, where we know what the matrices A and B are and we want to 
find the matrix X. For now, we’ll only consider equations of the type Ax = b, where 
we know the matrix A and the vector b. We will want to find what vector x satisfies 
this equation; we want to “solve for x.” 

To help understand what this is asking, we’ll consider an example. Let 


1 1 1 2 Xi 
A=|1 -1 2], b=]-3 and x= | x2 
2 0 1 1 X3 


(We don’t know what x is, so we have to represent it’s entries with the variables x1, x2 
and x3.) Let’s “solve for x,” given the equation Ax = b. 
We can multiply out the left hand side of this equation. We find that 


X, + X2 + x3 
AX = | xX, — xX + 2x3 


2X1 + X3 


Be sure to note that the product is just a vector; it has just one column. 


Since Ax is equal to b, we have 


X1 + X2 + X3 
X1 — Xo + 2x3 
2X1 + X3 


2.4 Vector Solutions to Linear Systems 


Knowing that two vectors are equal only when their corresponding entries are equal, 


we know 


Xy +X). + X3 = 


X1 — X2 + 2x3 


2X1 + X3 


2 
—3 
1. 


This should look familiar; it is a system of linear equations! Given the matrix-vector 
equation AX = b, we can recognize A as the coefficient matrix from a linear system 
and b as the vector of the constants from the linear system. To solve a matrix-vector 
equation (and the corresponding linear system), we simply augment the matrix A with 
the vector b, put this matrix into reduced row echelon form, and interpret the results. 

We convert the above linear system into an augmented matrix and find the reduced 


row echelon form: 


1 1 1 2 
, 
1 -1 2 -3 rref 
2 0 1 41 
This tells us that x; = 1, x7 = 2 and x3 = —1,s0 
1 
x= | 2 
—1 


We should check our work; multiply out Ax and verify that we indeed get b: 


1 11 1 2 
1 -1 2 2 does equal —3 
2 0 1 —1 1 


We should practice. 


Example 43 
1 2 3 
A=|-1 2 1 
1 1 O 

SOLUTION 


Solve the equation Ax = b for X where 


5 


and —1 


2 


The solution is rather straightforward, even though we did a lot of 


work before to find the answer. Form the augmented matrix [A b| and interpret its 


reduced row echelon form. 


POO 
PON 
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In previous sections we were fine stating that the result as 
X=2, x2=0, x3=1, 


but we were asked to find x; therefore, we state the solution as 


<1 
lI 
FON 


This probably seems all well and good. While asking one to solve the equation 
Ax = b for xX seems like a new problem, in reality it is just asking that we solve a 
system of linear equations. Our variables x,, etc., appear not individually but as the 
entries of our vector x. We are simply writing an old problem in a new way. 

In line with this new way of writing the problem, we have a new way of writing 
the solution. Instead of listing, individually, the values of the unknowns, we simply list 
them as the elements of our vector x. 

These are important ideas, so we state the basic principle once more: solving the 
equation Ax = b for X is the same thing as solving a linear system of equations. Equiv- 
alently, any system of linear equations can be written in the form AX = b for some 
matrix A and vector b. 

Since these ideas are equivalent, we'll refer to AX = b both as a matrix—vector 
equation and as a system of linear equations: they are the same thing. 

We’ve seen two examples illustrating this idea so far, and in both cases the linear 
system had exactly one solution. We know from Theorem 1 that any linear system has 
either one solution, infinite solutions, or no solution. So how does our new method of 
writing a solution work with infinite solutions and no solutions? 

Certainly, if Ax = b has no solution, we simply say that the linear system has no 
solution. There isn’t anything special to write. So the only other option to consider is 
the case where we have infinite solutions. We'll learn how to handle these situations 
through examples. 


Example 44 Solve the linear system Ax = 0 for X and write the solution in 
vector form, where 
1 2 =~ 0 
A= E ;| and O0= lel: 
SOLUTION (Note: we didn’t really need to specify that 


(3 


but we did just to eliminate any uncertainty.) 
To solve this system, put the augmented matrix into reduced row echelon form, 
which we do below. 


2.4 Vector Solutions to Linear Systems 


We interpret the reduced row echelon form of this matrix to write the solution as 


X, = —2x? 


X2 is free. 


We are not done; we need to write the solution in vector form, for our solution is the 
vector x. Recall that 
e x 
X2 


From above we know that x; = —2x, so we replace the x, in X with —2x>. This gives 


our solution as 
= —2x 
x= a 
x2 


Now we pull the x2 out of the vector (it is just a scalar) and write x as 


make 
xX =xX?2 1 : 


For reasons that will become more clear later, set 
_ —2 
= |g 


x= XoV. 


Thus our solution can be written as 


Recall that since our system was consistent and had a free variable, we have infinite 
solutions. This form of the solution highlights this fact; pick any value for x2 and we 
get a different solution. 

For instance, by setting x. = —1, 0, and 5, we get the solutions 


ga |= ° and | 2° 
~ f—1)? 0}’ 5 , 
respectively. 


We should check our work; multiply each of the above vectors by A to see if we 
indeed get 0. 

We have officially solved this problem; we have found the solution to AX = Oand 
written it properly. One final thing we will do here is graph the solution, using our skills 
learned in the previous section. 

mn] 
xX =X? 1 . 


Our solution is 
. , a —2|. . 
This means that any scalar multiply of the vector v = 1 is a solution; we know 


how to sketch the scalar multiples of /. This is done in Figure 2.18. 
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: 
Figure 2.18: The solution, as a line, to AX = 0 in Example 44. 


Here vector V is drawn as well as the line that goes through the origin in the direc- 
tion of v. Any vector along this line is a solution. So in some sense, we can say that the 
solution to AX = Ois a line. 


Let’s practice this again. 


Example 45 Solve the linear system AX = 0 and write the solution in vector 
form, where 
2 -3 
A= . 
SOLUTION Again, to solve this problem, we form the proper augmented ma- 


trix and we put it into reduced row echelon form, which we do below. 


[2 ao] om [o 0" | 


-> 3 0] ™ lo o a 
We interpret the reduced row echelon form of this matrix to find that 


xy = 3/2x2 


X2 is free. 


> x. 
x= |]. 
X2 


Since x, = 3/2x2, we replace x, in X with 3/2x): 


le 


Now we pull out the x2 and write the solution as 


z=m| 7]. 


As before, 


1 


2.4 Vector Solutions to Linear Systems 


As before, let’s set 


so we can write our solution as 
X = XoVv. 


Again, we have infinite solutions; any choice of x gives us one of these solutions. 
For instance, picking x2 = 2 gives the solution 


(This is a particularly nice solution, since there are no fractions. . .) 
As in the previous example, our solutions are multiples of a vector, and hence we 
can graph this, as done in Figure 2.19. 


y 


A 


vy 


Figure 2.19: The solution, as a line, to AX = Oin Example 45. 


Let’s practice some more; this time, we won’t solve a system of the form AX = 0, 
but instead Ax = b, for some vector b. 


Example 46 Solve the linear system AX = b, where 
1 2 ~ 3 
A=|) ;| and b=(é | 


SOLUTION (Note that this is the same matrix A that we used in Example 44. 
This will be important later.) 

Our methodology is the same as before; we form the augmented matrix and put it 
into reduced row echelon form. 


123) [23 
a i Gg Leo 
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Interpreting this reduced row echelon form, we find that 


X, = 3- 2x 


X2 is free. 


“ Xt 
x= 5 
9) 


and we replace x; with 3 — 2x3, giving 


zu ial 
X2 


This solution is different than what we’ve seen in the past two examples; we can’t 
simply pull out a x2 since there is a 3 in the first entry. Using the properties of matrix 
addition, we can “pull apart” this vector and write it as the sum of two vectors: one 
which contains only constants, and one that contains only “x2 stuff.” We do this below. 


-[ 
2] 
7 A aeiyg Fak 


Once again, let’s give names to the different component vectors of this solution 
(we are getting near the explanation of why we are doing this). Let 


[J = [2] 


We can then write our solution in the form 


Again, 


X = Xp + XV. 


We still have infinite solutions; by picking a value for x2 we get one of these solu- 
tions. For instance, by letting x2 = —1, O, or 2, we get the solutions 


A} [eo] o [34] 


We have officially solved the problem; we have solved the equation AX = b for x 
and have written the solution in vector form. As an additional visual aid, we will graph 
this solution. 

Each vector in the solution can be written as the sum of two vectors: x, and a 
multiple of V. In Figure 2.20, xp is graphed and Vis graphed with its origin starting at 
the tip of xp. Finally, a line is drawn in the direction of V from the tip of xp; any vector 
pointing to any point on this line is a solution to AX = b. 


2.4 Vector Solutions to Linear Systems 


{ 
Figure 2.20: The solution, as a line, to AX = bin Example 46. 


The previous examples illustrate some important concepts. One is that we can 
“see” the solution to a system of linear equations in a new way. Before, when we had 
infinite solutions, we knew we could arbitrarily pick values for our free variables and 
get different solutions. We knew this to be true, and we even practiced it, but the 
result was not very “tangible.” Now, we can view our solution as a vector; by picking 
different values for our free variables, we see this as multiplying certain important 
vectors by a scalar which gives a different solution. 

Another important concept that these examples demonstrate comes from the fact 
that Examples 44 and 46 were only “slightly different” and hence had only “slightly 
different” answers. Both solutions had 


in them; in Example 46 the solution also had another vector added to this. Was this 
coincidence, or is there a definite pattern here? 
Of course there is a pattern! Now ... what exactly is it? First, we define a term. 


Definition 17 Homogeneous Linear System of Equations 


A system of linear equations is homogeneous if the con- 
stants in each equation are zero. 


Note: a homogeneous system of equations can be written 
in vector form as AX = O. 


The term homogeneous comes from two Greek words; homo meaning “same” and 
genus meaning “type.” A homogeneous system of equations is a system in which each 
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equation is of the same type — all constants are 0. Notice that the system of equations 
in Examples 44 and 46 are homogeneous. 

Note that AO = 0; that is, if we set x = 0, we have a solution to a homogeneous set 
of equations. This fact is important; the zero vector is always a solution to a homoge- 
neous linear system. Therefore a homogeneous system is always consistent; we need 
only to determine whether we have exactly one solution (just 0) or infinite solutions. 
This idea is important so we give it it’s own box. 


Key Idea 7 Homogeneous Systems and Consistency 


All homogeneous linear systems are consistent. 


How do we determine if we have exactly one or infinite solutions? Recall Key Idea 
2: if the solution has any free variables, then it will have infinite solutions. How can 
we tell if the system has free variables? Form the augmented matrix [A 0], put it 
into reduced row echelon form, and interpret the result. 

It may seem that we’ve brought up a new question, “When does Ax = O have ex- 
actly one or infinite solutions?” only to answer with “Look at the reduced row echelon 
form of A and interpret the results, just as always.” Why bring up a new question if the 
answer is an old one? 


While the new question has an old solution, it does lead to a great idea. Let’s 
refresh our memory; earlier we solved two linear systems, 


where 
1 2 - 3 
a=|3 ‘| and b=(é|- 


The solution to the first system of equations, AX = 0, is 


aca 
X =X? 1 


and the solution to the second set of equations, AX = b, is 


a +x a 
~ 10 ei 4. | 
for all values of x>. 


Recalling our notation used earlier, set 


- 3 - —2 
X =, and let ze |. 


2.4 Vector Solutions to Linear Systems 


Thus our solution to the linear system AX = bis 


> 


= = 
X = Xp + XV. 


Let us see how exactly this solution works; let’s see why Ax equals b. Multiply Ax: 


AX = A(Xp + X2V) 
= Ax, + A(x2V) 
= Ax + X2(AV) 


= Ax, + x20 
= AX, + 0 
= AX, 

=b 


We know that the last line is true, that Ax, = b, since we know that x was a solution 
to Ax = b. The whole point is that x itself is a solution to AX = b, and we could find 
more solutions by adding vectors “that go to zero” when multiplied by A. (The subscript 
p of “x,” is used to denote that this vector is a “particular” solution.) 


Stated in a different way, let’s say that we know two things: that Ax, = b and 
Av = 0. What is A(xp + V)? We can multiply it out: 


and see that A(x, + V/) also equals b. 


So we wonder: does this mean that Ax — b will have infinite solutions? After all, 
if x, and x, + Vare both solutions, don’t we have infinite solutions? 


No. If AX = O has exactly one solution, then V = 0, and x, = X, + V; we only have 
one solution. 


So here is the culmination of all of our fun that started a few pages back. If Vis 
a solution to AX = O and x; is a solution to Ax = b, then x, + ‘is also a solution to 
Ax = b. If Ax = O has infinite solutions, so does AX = b; if Ax = O has only one 
solution, so does AX = b. This culminating idea is of course important enough to be 
stated again. 
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Key Idea 8 Solutions of Consistent Systems 


Let Ax = b be a consistent system of linear equations. 


1. If Ax = 0 has exactly one solution (x = 0), then Ax = 
b has exactly one solution. 


2. If AX = GO has infinite solutions, then AX = b has infi- 
nite solutions. 


A key word in the above statement is consistent. If AX = b is inconsistent (the 
linear system has no solution), then it doesn’t matter how many solutions AX = 0 has; 
Ax = b has no solution. 

Enough fun, enough theory. We need to practice. 


Example 47 Let 
1 -1 1 3 ~ 1 
al 2 4 | ane Flip 


Solve the linear systems Ax = O and Ax = b for X, and write the solutions in vector 
form. 


SOLUTION We'll tackle Ax = 0 first. We form the associated augmented ma- 
trix, put it into reduced row echelon form, and interpret the result. 


1-113 0) 7a |1 9 1 2 «0 
4 2 4 6 0 0 1 0 -1 0 


X1 = —X3 — 2x4 
X2 =X 
x3 is free 


Xz, is free 


To write our solution in vector form, we rewrite x, and xz in X in terms of x3 and xq. 


X1 —X3 — 2X4 
an X2 X4 
X= = 

X3 X3 

X4 Xa 


Finally, we “pull apart” this vector into two vectors, one with the “x3 stuff” and one 
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with the “x, stuff.” 


—X3 — 2X4 
X= - 
X3 
X4 
—X3 —2X4 
0 Xa 
— X3 T 0 
0 Xa 
—1 —2 
0 1 
ea ee ea 
0 1 
= X30 + X4V 


We use U and V simply to give these vectors names (and save some space). 

It is easy to confirm that both U and V are solutions to the linear system Ax = 0. 
(Just multiply Au and Av and see that both are 0.) Since both are solutions to a ho- 
mogeneous system of linear equations, any linear combination of & and V will be a 
solution, too. 


Now let’s tackle Ax = b. Once again we put the associated augmented matrix into 
reduced row echelon form and interpret the results. 


1-113 1] — [101 2 2 
4 2 @4-6 a0) “" lo ao —i-4 


X, = 2 — x3 — 2x, 
X%=1+%, 
X3 is free 


Xq is free 


Writing this solution in vector form gives 


X1 2 — x3 — 2X4 
= x 14x. 
x= 2 = 4 

X3 X3 

Xa X4 


Again, we pull apart this vector, but this time we break it into three vectors: one with 
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“x3” stuff, one with “x,” stuff, and one with just constants. 


xX a! 1 + Xa 
x3 
X4 
1 Te) [e 
1 ay 0 it Xa 
0 X3 0 
0 0 Xa 
2 —1 2 
eee Oo}, 1 
= 0 X3 1 X4 0 
0 0 1 
= Xp + X34 +x4V 
“SY —S—__’ 
particular solution to 
solution homogeneous 


equations Ax = 0 


Note that Ax, = b; by itself, Xp is a solution. To get infinite solutions, we add a bunch 
of stuff that “goes to zero” when we multiply by A; we add the solution to the homo- 
geneous equations. 


Why don’t we graph this solution as we did in the past? Before we had only two 
variables, meaning the solution could be graphed in 2D. Here we have four variables, 
meaning that our solution “lives” in 4D. You can draw this on paper, but it is very con- 
fusing. 


Example 48 Rewrite the linear system 
X, + 2x. — 3x3 + 2x, + 7x, = 2 
3x, + 4%. + 5x3 + 2x, + 3x5 = —4 


as a matrix-vector equation, solve the system using vector notation, and give the so- 
lution to the related homogeneous equations. 


SOLUTION Rewriting the linear system in the form of AX = b, we have that 
X41 
x2 
12 -3 2 7 = ~ 2 
Aa A 5 2 31° X= | x3 and p=|%). 
X4 
Xs 


To solve the system, we put the associated augmented matrix into reduced row eche- 
lon form and interpret the results. 


2.4 Vector Solutions to Linear Systems 


X, = —8 — 11x3 + 2x, + 11x, 
X2 = 5+ 7x3 — 2x4 — 9X5 

X3 is free 

Xz, is free 


Xs is free 


We use this information to write x, again pulling it apart. Since we have three free 
variables and also constants, we’ll need to pull X apart into four separate vectors. 


<1 
II 
x< 
w 


—8 — 11x3 + 2x, + 11xs 
5 + 7x3 — 2x4 — 9X5 


= X3 
X4 
X5 
—8 —11x3 2X4 11x; 
5 7X3 —2Xx4 —9xs 
=) 0 [+] a Poe] Oo. |e) oO 
0 0 X4 0 
0 0 0 Xs 
—8 —11 2 11 
=|] 0 | +x3 1 +xa]! 0 | +x5} 0 
0 0 1 0 
0 0 0 1 
= xX + X3U + xX4V + XSW 
antl -—TIS 
particular solution to homogeneous 


solution equations Ax = 0 


So Xp is a particular solution; Ax, = b. (Multiply it out to verify that this is true.) 
The other vectors, U, V and w, that are multiplied by our free variables x3, x4 and xs, 
are each solutions to the homogeneous equations, Ax = 0. Any linear combination 
of these three vectors, i.e., any vector found by choosing values for x3, xq and xs in 
x3U + x,V¥ + X<W is a solution to Ax = 0. 
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Example 49 Let 


Find the solutions to Ax = b and Ax = O. 


SOLUTION We go through the familiar work of finding the reduced row eche- 
lon form of the appropriate augmented matrix and interpreting the solution. 


12 3) Fe {1 9 -1 
45 6 01 2 


X,=-1 


Xx=2 


gi iges 
x= a : 
X2 2 
This may strike us as a bit odd; we are used to having lots of different vectors in the 
solution. However, in this case, the linear system Ax = b has exactly one solution, and 
we’ve found it. What is the solution to Ax = 0? Since we’ve only found one solution to 
Ax = b, we can conclude from Key Idea 8 the related homogeneous equations Ax = 0 


have only one solution, namely xX = 0. We can write our solution vector x in a form 
similar to our previous examples to highlight this: 


[2 


Thus 


=,/=2] a./¢ 
2 0 

= % + 6 
“MY 7 
particular solution to 
solution Ax =0 


Example 50 Let 
1 $1 > 1 
A= 3 | and b= ke 
Find the solutions to Ax = b and Ax = O. 


SOLUTION To solve AX = b, we put the appropriate augmented matrix into 
reduced row echelon form and interpret the results. 


1-2. a) Se 2. 1.0 
a oo |) SP hee. ig ai 
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We immediately have a problem; we see that the second row tells us that Ox; + 
Ox, = 1, the sign that our system does not have a solution. Thus Ax = b has no 
solution. Of course, this does not mean that Ax = 0 has no solution; it always has a 
solution. 

To find the solution to Ax = 0, we interpret the reduced row echelon form of the 
appropriate augmented matrix. 


110) 7a |1 1 0 
2 2 0 0 0 O 


XxX, = —-X2 


X2 is free 


Thus 


X2 


= X1 
x= 
X2 
X2 


= X2 1 
XU. 


We have no solution to Ax = b, but infinite solutions to Ax = O. 


I 


The previous example may seem to violate the principle of Key Idea 8. After all, 
it seems that having infinite solutions to AX = 0 should imply infinite solutions to 
Ax = b. However, we remind ourselves of the key word in the idea that we observed 
before: consistent. If AX = b is consistent and Ax = O has infinite solutions, then so 
will AX = b. But if Ax = bis not consistent, it does not matter how many solutions 
Ax = O has; Ax = bis still inconsistent. 


This whole section is highlighting a very important concept that we won’t fully un- 
derstand until after two sections, but we get a glimpse of it here. When solving any 
system of linear equations (which we can write as AX = b), whether we have exactly 
one solution, infinite solutions, or no solution depends on an intrinsic property of A. 
We'll find out what that property is soon; in the next section we solve a problem we 
introduced at the beginning of this section, how to solve matrix equations AX = B. 


Exercises 2.4 


In Exercises 1 — 6, a matrix A and vectors b, u 1A- 1 Pal 

and V are given. Verify that and V are both —3 6 |’ 

solutions to the equation AX = b; that is, 5 ol . 21, 10 
show that Ad = AV = b. b= (5)-2=[7]-7-[23] 
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pl 2a pas [Ot e 
[Zh e=[S}ee [5 
1 0 
=|) oh 
re a ale ce 
[oh a=[} = [s5 
1 0 
=|) of 
> [-3] ._ [-3] -_ [-3 
pee al 
0-3. +133 
sa=| S, 2 +3 | 
re 
Oil a 4 
b= [5 )-2- 42: |" 
0 
9 
ges, ode 
= 50 
12 
O <3 +1 43 
a=| 9 2 -3 5] 
=17 
> [48] ._ | -16 
ee |" 
0 
-8 
sey 28 
~ | 0 
12 


a! 


In Exercises 7 —9, a matrix A and vectors b, 
and (are given. Verify that Ad = 0, Av = 
and A(u+V) =b. 


u 
b 


O27 

GAs or OR ea 

er 
di 1 1 
b=]1],@=—]1],v=] 1 
1 0 4 
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4 2 2 
b=|-3|,d=|2],7=]3 
1 0 0 
20 0 
9.A=|0 1 -3], 
B. hag 
2 0 1 
b=|-4/,d=|6],¥=|-1 
= 2 1 


In Exercises 10 — 24, a matrix A and vector b 
are given. 


(a) Solve the equation Ax = 0. 
(b) Solve the equation Ax = b. 
In each of the above, be sure to write your an- 
swer in vector format. Also, when possible, 


give 2 particular solutions to each equation. 


0. a=|° S| =| | 
ay ie eat 4| 
12. A= io 7 |,8= il 
13. A=], Seal | 
a=] 2, 2) e=[4] 
1s. a=| 74 i 5 |= [=a 
16. a= |} eel 
v.a=|3 = 3) 8= [=] 
8. =|? : Sale| 
9. a= |} ; ie zal 
[9 
aml ot oa sh 
[3 


2.5 Solving Matrix Equations AX = B 


21. A= 


In Exercises 25 — 28, a matrix A and vector b 
are given. Solve the equation AX = b, write 
the solution in vector format, and sketch the 
solution as the appropriate line on the Carte- 


| 
| 
: 
- E spare 
| 
| 
| 


22. A= 


4 eee oe 0 
25. A= b= 
-1 3 1 -3 4 |—-1 2, lo 
23. A=] 3 -3 -1 1 -4}], 7 
=e 2 ee ee. =| 4 el ps8 
26.A=| |) 4 )e-| 3 
1 L 
b=| 1 - 
2 -5S]> f1 
—5 aga ela 
—-4 -2 -1 4 90 
24.A=]5 -4 3 -1 11], {2 -5] 7 fo 
4 -5 3 1 -4 = | —4 a) E= i 


2.5 Solving Matrix Equations AX = B 


AS YOU READ 


1. T/F: To solve the matrix equation AX = B, put the matrix | AX] into reduced 
row echelon form and interpret the result properly. 


2. T/F: The first column of a matrix product AB is A times the first column of B. 


3. Give two reasons why one might solve for the columns of X in the equation AX=B 
separately. 


We began last section talking about solving numerical equations like ax = b for x. 
We mentioned that solving matrix equations of the form AX = B is of interest, but we 
first learned how to solve the related, but simpler, equations Ax = b. In this section 
we will learn how to solve the general matrix equation AX = B for X. 

We will start by considering the best case scenario when solving Ax = b; that is, 
when A is square and we have exactly one solution. For instance, suppose we want to 


solve Ax — b where 
1 1 > 0 
a=|) ‘| and b=(¢]. 
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We know how to solve this; put the appropriate matrix into reduced row echelon form 
and interpret the result. 


01 -1 


z=| 4]. 


Written in a more general form, we found our solution by forming the augmented 
matrix 


pismo 4 


We read from this that 


[A 6] 
and interpreting its reduced row echelon form: 
a5 => - 
[A b] rref [1 x] 
Notice that when the reduced row echelon form of A is the identity matrix / we have 
exactly one solution. This, again, is the best case scenario. 
We apply the same general technique to solving the matrix equation AX = B for X. 
We'll assume that A is a square matrix (B need not be) and we’ll form the augmented 


matrix 
[A B]. 


Putting this matrix into reduced row echelon form will give us X, much like we found x 
before. 


— 
rref 


[AB] wet [1 x] 


As long as the reduced row echelon form of A is the identity matrix, this technique 
works great. After a few examples, we’ll discuss why this technique works, and we'll 
also talk just a little bit about what happens when the reduced row echelon form of A 
is not the identity matrix. 

First, some examples. 


Example 51 Solve the matrix equation AX = B where 
1 -1 —-8 -13 1 
aa E 3 | ie eee E ~17 Ale 
SOLUTION To solve AX = B for X, we form the proper augmented matrix, put 


it into reduced row echelon form, and interpret the result. 


1-1 -8 -13 1) Fe |1 9 1 -7 3 
5 3 32 -17 21 019 6 2 


We read from the reduced row echelon form of the matrix that 


1-7 3 
x=[5 6 a 


2.5 Solving Matrix Equations AX = B 


We can easily check to see if our answer is correct by multiplying AX. 


Example 52 Solve the matrix equation AX = B where 
1 O 2 -1 2 
A= |0 -1 -2 and B=} 2 -6 
2 -1 O 2 —-4 
SOLUTION To solve, let’s again form the augmented matrix 
[A B], 


put it into reduced row echelon form, and interpret the result. 


io S —t 2 100 1 0 
0 -1 -2 2 -6| mee JO 10 0 4 
2 <1 OG 2 =A GO t ot 4 


We see from this that 


Why does this work? To see the answer, let’s define five matrices. 


t 2) « Nt) 2 Jat) wis ft =4 5 
e=[3 a] o=[a]- [2] e=[] mex=[2 a | 


Notice that @, Vv and W are the first, second and third columns of X, respectively. 
Now consider this list of matrix products: Au, AV, Aw and AX. 


az «Fe 4 7 
~ | 39 ~ 17 1 39 
So again note that the columns of X are U, V and w; that is, we can write 


X=|[G Vv wl]. 99 
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Notice also that the columns of AX are Au, AV and Aw, respectively. Thus we can write 


We summarize what we saw above in the following statement: 
The columns of a matrix product AX are A times the columns of X. 


How does this help us solve the matrix equation AX = B for X? Assume that A is a 
square matrix (that forces X and B to be the same size). We’ Il let x7, X2,-++Xp_ denote 
the columns of the (unknown) matrix X, and we'll let bi, Bao - bn denote the columns 
of B. We want to solve AX = B for X. That is, we want X where 


AX =B 
A(x xX) |= [by b> by, | 
[Axi Ax AG |= iy. BB by, | 


If the matrix on the left hand side is equal to the matrix on the right, then their 
respective columns must be equal. This means we need to solve n equations: 


Axi by 
AX> = b> 
A ‘n =Pn 


We already know how to do this; this is what we learned in the previous section. 
Let’s do this in a concrete example. In our above work we defined matrices A and X, 
and looked at the product AX. Let’s call the product B; that is, set B= AX. Now, let’s 
pretend that we don’t know what X is, and let’s try to find the matrix X that satisfies 
the equation AX = B. As a refresher, recall that 


1 2 
Since Ais a2 x 2 matrix and Bis a2 x 3 matrix, what dimensions must X be in the 
equation AX = B? The number of rows of X must match the number of columns of A; 


the number of columns of X must match the number of columns of B. Therefore we 
know that X must be a 2 x 3 matrix. 


2.5 Solving Matrix Equations AX = B 


We'll call the three columns of X x7, x. and x3. Our previous explanation tells us 
that if AX = B, then: 


AX = 8B 
ali a )=[7 7 5a] 
[ Axi AX5 aa] =| ; a 


Hence 


To find x;, we form the proper augmented matrix and put it into reduced row ech- 
elon form and interpret the results. 


eH i eae 
Bo eel EEE > Ih 


a= |4]. 


To find x3, we again form an augmented matrix and interpret its reduced row ech- 


elon form. 
1. Qed tek 10 -1 
Bs || PEP oa a 


gals 
aa fe 
which matches with what we already knew from above. 


Before continuing on in this manner to find x3, we should stop and think. If the 
matrix vector equation Ax = b is consistent, then the steps involved in putting 


This shows us that 


Thus 


[A 5] 


into reduced row echelon form depend only on A; it does not matter what bis. So 
when we put the two matrices 


ee ae 5 Tee 
Cee ee 2 Ce oe 
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from above into reduced row echelon form, we performed exactly the same steps! (In 
fact, those steps are: —3R, + Ry —> Ro; —}R2 > Rp; —2R. + Ri > Ry.) 

Instead of solving for each column of X separately, performing the same steps to 
put the necessary matrices into reduced row echelon form three different times, why 
don’t we just do it all at once??® Instead of individually putting 


1 2 3 121) oq [2 2 
3 4 7]? |3 4 1 3 4 39 


into reduced row echelon form, let’s just put 


1 2 3 1 #17 
3 4 7 1 39 


into reduced row echelon form. 


£2 3 a a7] oy [10 a 2S 
24.7 4 a9) “ 


By looking at the last three columns, we see X: 


1 -1 5 
x= [TT 6 


Now that we’ve justified the technique we’ve been using in this section to solve 
AX = B for X, we reinfornce its importance by restating it as a Key Idea. 


Key Idea 9 Solving AX = B 


Let A be ann x n matrix, where the reduced row echelon 
form of A is /. To solve the matrix equation AX = B for X, 


1. Form the augmented matrix [A 8B]. 


2. Put this matrix into reduced row echelon form. It 
will be of the form [/ X], where X appears in the 
columns where B once was. 


These simple steps cause us to ask certain questions. First, we specify above that A 
should be a square matrix. What happens if A isn’t square? Is a solution still possible? 
Secondly, we only considered cases where the reduced row echelon form of A was /| 
(and stated that as a requirement in our Key Idea). What if the reduced row echelon 
form of A isn’t /? Would we still be able to find a solution? (Instead of having exactly 
one solution, could we have no solution? Infinite solutions? How would we be able to 
tell?) 


16Qne reason to do it three different times is that we enjoy doing unnecessary work. Another reason 
could be that we are stupid. 


2.6 


The Matrix Inverse 


These questions are good to ask, and we leave it to the reader to discover their 
answers. Instead of tackling these questions, we instead tackle the problem of “Why 
do we care about solving AX = B?” The simple answer is that, for now, we only care 
about the special case when B = /. By solving AX = | for X, we find a matrix X that, 
when multiplied by A, gives the identity /. That will be very useful. 


Exercises 2.5 


In Exercises 1 —- 12, matrices A and B are 
given. Solve the matrix equation AX = B. 


.a=| 4 oh 
e=| 7 as | 
2 a=| 3, ek 
e=[ oy | 
sae a 
o=[% -2] 
aa=[ ik 
oi | 
ca=\ zl 
eal : | 
«a= | ah 


2.6 The Matrix Inverse 


AS YOU READ 


10. 


11. 


12. 


—2 -10 19 
13 022~«-2 
1 0 

3 ol va 
22 

3 i — 
—2 0 4 
-5 -4 5 |, 
3 5 -3 
=i - 2- ad 
—38 18 —13 
10 2 —18 
-5 -4 -1 
8 -—2 3), 
6 1 -8 
—21 -8 —-19 
65 —11 —10 
75 —51 33 
0 a2 4 

Oo 2 25 
1 2 23 

a3 3 2 

1 238 Sl 
— al 2 


1. T/F: lf A and B are square matrices where AB = |, then BA = 1. 


2. T/F: A matrix A has exactly one inverse, infinite inverses, or no inverse. 


3. T/F: Everyone is special. 
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4. T/F: If A is invertible, then Ax = O has exactly 1 solution. 
5. What is a corollary? 


6. Fillin the blanks: a matrix is invertible is useful; computing the inverse 
is 


Once again we visit the old algebra equation, ax = b. How do we solve for x? We 
know that, as long asa 4 0, 


b 
X = —, or, stated in another way, x = a ‘b. 
a 


What is a~+? It is the number that, when multiplied by a, returns 1. That is, 
a -a=1. 


Let us now think in terms of matrices. We have learned of the identity matrix / that 
“acts like the number 1.” That is, if A is a square matrix, then 


IA=AI=A. 


If we had a matrix, which we'llcallA~+, whereA~*A = /, then by analogy to our algebra 
example above it seems like we might be able to solve the linear system Ax = b for x 
by multiplying both sides of the equation by A~?. That is, perhaps 

X¥=A~‘b. 


Of course, there is a lot of speculation here. We don’t know that such a matrix like 
A+ exists. However, we do know how to solve the matrix equation AX = B, so we 
can use that technique to solve the equation AX = | for X. This seems like it will get us 
close to what we want. Let’s practice this once and then study our results. 


Example 53 Let 


Find a matrix X such that AX = /. 


SOLUTION We know how to solve this from the previous section: we form 
the proper augmented matrix, put it into reduced row echelon form and interpret the 
results. 


ae er ee 
‘4 6@ a jhe. a wa, @ 


We read from our matrix that 


104 


2.6 The Matrix Inverse 


Let’s check our work: 


Sure enough, it works. 


Looking at our previous example, we are tempted to jump in and call the matrix X 
that we found “A~?.” However, there are two obstacles in the way of us doing this. 

First, we know that in general AB 4 BA. So while we found that AX = /, we can’t 
automatically assume that XA = /. 

Secondly, we have seen examples of matrices where AB = AC, but B # C. So just 
because AX = |, it is possible that another matrix Y exists where AY = /. If this is the 
case, using the notation A~* would be misleading, since it could refer to more than 
one matrix. 

These obstacles that we face are not insurmountable. The first obstacle was that 
we know that AX = / but didn’t know that XA = /. That’s easy enough to check, 
though. Let’s look at A and X from our previous example. 


m=[2 SULT 


Perhaps this first obstacle isn’t much of an obstacle after all. Of course, we only 
have one example where it worked, so this doesn’t mean that it always works. We 
have good news, though: it always does work. The only “bad” news to come with this 
is that this is a bit harder to prove. We won’t worry about proving it always works, but 
state formally that it does in the following theorem. 


Theorem 5 


Special Commuting Matrix Products 


Let A be ann x n matrix. 
1. If there is a matrix X such that AX = /|,, then XA = I. 


2. If there is a matrix X such that XA = /,, then AX = Ip. 


The second obstacle is easier to address. We want to know if another matrix Y 


exists where AY = | = YA. Let’s suppose that it does. Consider the expression XAY. 
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Since matrix multiplication is associative, we can group this any way we choose. We 
could group this as (XA)Y; this results in 


We could also group XAY as X(AY). This tells us 


X(AY) = XI 
=X 


Combining the two ideas above, we see that X = XAY = Y; that is, X = Y. We 
conclude that there is only one matrix X where XA = | = AX. (Even if we think we 
have two, we can do the above exercise and see that we really just have one.) 

We have just proved the following theorem. 


Theorem 6 Uniqueness of Solutions to AX = /, 


Let A be ann x n matrix and let X be a matrix where AX = 


I,. Then X is unique; it is the only matrix that satisfies this 
equation. 


So given a square matrix A, if we can find a matrix X where AX = /, then we know 
that XA = | and that X is the only matrix that does this. This makes X special, so we 
give it a special name. 


Definition 18 Invertible Matrices and the Inverse of A 


Let A and X be n x n matrices where AX = | = XA. Then: 


1. Ais invertible. 


2. Xis the inverse of A, denoted by A~?. 


Let’s do an example. 


1 2 
Example 54 Find the inverse of A = | , | ‘ 
SOLUTION By solving the equation AX = | for X will give us the inverse of A. 


Forming the appropriate augmented matrix and finding its reduced row echelon form 
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gives us 


tos] [oot Ah 


a 4.04), US 1G @ a aap 


Yikes! We were expecting to find that the reduced row echelon form of this matrix 
would look like 


[1 Aut]. 


However, we don’t have the identity on the left hand side. Our conclusion: A is not 
invertible. 


We have just seen that not all matrices are invertible.1” With this thought in mind, 
let’s complete the array of boxes we started before the example. We’ve discovered 
that if a matrix has an inverse, it has only one. Therefore, we gave that special matrix 
aname, “the inverse.” Finally, we describe the most general way to find the inverse of 
a matrix, and a way to tell if it does not have one. 


Key Idea 10 Finding A~* 


Let A be ann x n matrix. To find A~?, put the augmented 
matrix 


[Ah] 


into reduced row echelon form. If the result is of the form 


[in x], 


then A? =X. If not, (that is, if the first n columns of the re- 
duced row echelon form are not /,,), then A is not invertible. 


Let’s try again. 


1 1 -1 
Example 55 Find the inverse, if it exists, of A= |;1 -—-1 1 
1 2 3 
SOLUTION We'll try to solve AX = / for X and see what happens. 
1 1 -1 100 = 100 05 0.5 ) 
1-1 1 0410 rref 010 02 -04 0.2 
1 2 3 001 001 -03 01 0.2 


Hence our previous definition; why bother calling A “invertible” if every square matrix is? If everyone 
is special, then no one is. Then again, everyone is special. 
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We have a solution, so 


0.5 0.5 0 
A=} 02 —04 0.2 
—0.3 01 0.2 


Multiply AA~? to verify that it is indeed the inverse of A. 

In general, given a matrix A, to find A~*+ we need to form the augmented matrix 
[A 1] and put it into reduced row echelon form and interpret the result. In the case 
of a 2 x 2 matrix, though, there is a shortcut. We give the shortcut in terms of a 
theorem."® 


Theorem 7 The Inverse of a 2x2 Matrix 


Let 


A is invertible if and only if ad — bc £ 0. 


If ad — bc £ O, then 


We can’t divide by 0, so if ad — bc = O, we don’t have an inverse. Recall Example 


54, where 
1 2 
ae. 


Here, ad — bc = 1(4) — 2(2) = 0, which is why A didn’t have an inverse. 
Although this idea is simple, we should practice it. 


Example 56 Use Theorem 7 to find the inverse of 


*=[4 | 


if it exists. 


18We don’t prove this theorem here, but it really isn’t hard to do. Put the matrix 
a b 1 0 
c d 0 1 


into reduced row echelon form and you’ll discover the result of the theorem. Alternatively, multiply A by 
what we propose is the inverse and see that we indeed get /. 


2.6 The Matrix Inverse 


SOLUTION Since ad — bc = 29 £0, A~' exists. By the Theorem, 
Ata 1 9 -2 
3(9)—2(-1) |1 3 
7 i 9 -2 
~ 29/1 3 


We can leave our answer in this form, or we could “simplify” it as 


ost s|elihs af | 


We started this section out by speculating that just as we solved algebraic equa- 
tions of the form ax = b by computing x = a~+b, we might be able to solve matrix 
equations of the form Ax = b by computing x = A~b. If A~1 does exist, then we can 
solve the equation Ax = b this way. Consider: 


Ax =b (original equation) 
A-1AxK = A~'b (multiply both sides on the left by A_*) 
iX = A-'b (since A~*A = 1) 
X=A b (since Ix = x) 


Let’s step back and think about this for a moment. The only thing we know about 
the equation Ax = b is that A is invertible. We also know that solutions to AX = b 
come in three forms: exactly one solution, infinite solutions, and no solution. We just 
showed that if A is invertible, then Ax = b has at least one solution. We showed that 
by setting x equal to A~b, we have a solution. Is it possible that more solutions exist? 

No. Suppose we are told that a known vector Vis a solution to the equation Ax = b; 
that is, we know that AV = b. We can repeat the above steps: 


AV =b 

A-tav = A~'b 
A-'b 
= 


> 


y= 


This shows that all solutions to Ax = b are exactly X = A~1b when Ais invertible. We 
have just proved the following theorem. 
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Theorem 8 Invertible Matrices and Solutions to Ax — b 


Let A be an invertible n x n matrix, and let b be anyn x 1 


column vector. Then the equation AX = b has exactly one 
solution, namely 


X¥=An‘b. 


A corollary’? to this theorem is: If A is not invertible, then Ax = b does not have 
exactly one solution. It may have infinite solutions and it may have no solution, and 
we would need to examine the reduced row echelon form of the augmented matrix 
[A b| to see which case applies. 

We demonstrate our theorem with an example. 


Example 57 Solve Ax = b by computing ¥ = A~‘b, where 
1 0 —3 —15 
A=]|-3 -—-4 10 and b= 57 
4 -5 -11 —46 
SOLUTION Without showing our steps, we compute 

94. 15 —12 
At=| 7 4 =1 
31 #55 —4 


We then find the solution to Ax = b by computing A~1b: 


X=A'b 
94 15 —12 —a5 
ee | 9 ed 57 
3165 «=A —46 
=3 
=| =2 
4 


We can easily check our answer: 


40 =3 ae —15 
a2. 8 -i9 —2 |=] 57 
A. =o at 4 —46 


19a corollary is an idea that follows directly from a theorem 
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Knowing a matrix is invertible is incredibly usefu 
you know A is invertible, then you know for sure that Ax = b has asolution (as we just 
stated in Theorem 8). In the next section we’ll demonstrate many different properties 
of invertible matrices, including stating several different ways in which we know that 


a matrix is invertible. 


Exercises 2.6 


|,20 


2.6 The Matrix Inverse 


Among many other reasons, if 


In Exercises 1-8, A matrix A is given. Find 
A} using Theorem 7, if it exists. 


cise 


In Exercises 9 — 28, a matrix A is given. Find 
A~* using Key Idea 10, if it exists. 


9. 


10. 


11. 


12. 


13. 


| —5 


1 0 


[ -2 


| 1 


[—5 


9 


5 


| 5/3 


25 
—18 
—6 


5 
=A 


ae 
A, 23 


| 


| 
| 


| 
—10 


7 
2 


-4 
3 
1 


14. 


15. 


18. 


19. 


21. 


22. 


23. 


i ee ce ee ee ee eee ee oe eee ee ee ee ek ee ee ek en ee ee or a eo oe 


oOoOrP FOO 


Oro 


0 


no 


—9 


0 


BRO OFROO 


1 


20As odd as it may sound, knowing a matrix is invertible is useful; actually computing the inverse isn’t. 


This is discussed at the end of the next section. 
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—15 45 —3 4 10 —57 - —14 
95 55 —164 15 —15 Beis 3 iA a. —4 
“1-215 640 -62 59 
4 12 0 1 2 eee 
33.A=]0 1 6], 
1 O 2 8 —-3 0 1 
26. Oo 1 0 0 : 47 
0 -—-4 ~-—29 —110 b=| —-5 
0 -3 —-5 —19 20 
0 0 1 0 1 0 3 
7 000 1 34.A=]8 -—2 —13], 
“|1 0 0 0 12 -—3 —20 
0 100 _34 
100 0 b= |} —159 
(0 2 8:8 ak 
“10 0 3 40 5 0 -2 
0 0 0 -4 wan |s * S12 
In Exercises 29 — 36, a matrix A and a vector —2 0 1 
b are given. Solve the equation Ax = b using 33 
Theorem 8. b= | —70 
—15 
3 5 ~ 21 
29. a=|3 a ce a 
1 4 34 36.A=]0 1 O|f, 
= im b= 2 -8 1 
a F “eh 2 ie 
—69 
9 70 ~ —2 b= 10 
aah, Ar b= | 1 —102 


2.7 Properties of the Matrix Inverse 


AS YOU READ 


1. What does it mean to say that two statements are “equivalent?” 
2. T/F: If A is not invertible, then Ax = 0 could have no solutions. 


T/F: If A is not invertible, then Ax = b could have infinite solutions. 


Poe 


What is the inverse of the inverse of A? 


5. T/F: Solving AX = b using Gaussian elimination is faster than using the inverse 
of A. 


We ended the previous section by stating that invertible matrices are important. 
Since they are, in this section we study invertible matrices in two ways. First, we look 
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at ways to tell whether or not a matrix is invertible, and second, we study properties 
of invertible matrices (that is, how they interact with other matrix operations). 

We start with collecting ways in which we know that a matrix is invertible. We 
actually already know the truth of this theorem from our work in the previous section, 
but it is good to list the following statements in one place. As we move through other 
sections, we'll add on to this theorem. 


Theorem 9 Invertible Matrix Theorem 


Let A be annxn matrix. The following statements are equiv- 
alent. 


(a) A is invertible. 
(b) There exists a matrix B such that BA = /. 
(c) There exists a matrix C such that AC = /. 


(d) The reduced row echelon form of A is /. 


(e) The equation Ax = b has exactly one solution for ev- 
ery n x 1 vector b. 


(f) The equation AX = 0 has exactly one solution 
(namely, X = 0). 


Let’s make note of a few things about the Invertible Matrix Theorem. 


1. First, note that the theorem uses the phrase “the following statements are equiv- 
alent.” When two or more statements are equivalent, it means that the truth of 
any one of them implies that the rest are also true; if any one of the statements 
is false, then they are all false. So, for example, if we determined that the equa- 
tion AX = Ohad exactly one solution (and A was ann Xx n matrix) then we would 
know that A was invertible, that Ax = bhad only one solution, that the reduced 
row echelon form of A was /, etc. 


2. Let’s go through each of the statements and see why we already knew they all 
said essentially the same thing. 


(a) This simply states that A is invertible — that is, that there exists a matrix 
A~* such that A~*A = AA~* = |. We'll go on to show why all the other 
statements basically tell us “A is invertible.” 


(b) If we know that A is invertible, then we already know that there is a matrix 
B where BA = /. That is part of the definition of invertible. However, we 
can also “go the other way.” Recall from Theorem 5 that even if all we know 
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is that there is a matrix B where BA = |, then we also know that AB = /. 
That is, we know that B is the inverse of A (and hence A is invertible). 


(c) We use the same logic as in the previous statement to show why this is the 
same as “A is invertible.” 


(d) If Ais invertible, we can find the inverse by using Key Idea 10 (which in turn 
depends on Theorem 5). The crux of Key Idea 10 is that the reduced row 
echelon form of A is /; if it is something else, we can’t find A~? (it doesn’t 
exist). Knowing that A is invertible means that the reduced row echelon 
form of A is /. We can go the other way; if we know that the reduced row 
echelon form of A is /, then we can employ Key Idea 10 to find At, so A is 
invertible. 


(e) We know from Theorem 8 that if A is invertible, then given any vector b, 
Ax = bhas always has exactly one solution, namely x = A~1b. However, 
we can go the other way; let’s say we know that Ax = b always has exactly 
solution. How can we conclude that A is invertible? 


Think about how we, up to this point, determined the solution to AX = 
b. We set up the augmented matrix [A b| and put it into reduced row 
echelon form. We know that getting the identity matrix on the left means 
that we had a unique solution (and not getting the identity means we either 
have no solution or infinite solutions). So getting / on the left means having 
a unique solution; having / on the left means that the reduced row echelon 
form of A is /, which we know from above is the same as A being invertible. 


(f) This is the same as the above; simply replace the vector b with the vector 
0. 


So we came up with a list of statements that are all equivalent to the statement 
“A is invertible.” Again, if we know that if any one of them is true (or false), then 
they are all true (or all false). 


Theorem 9 states formally that if A is invertible, then Ax = b has exactly one solu- 
tion, namely A~*b. What if A is not invertible? What are the possibilities for solutions 
to AX = b? 


We know that Ax = b cannot have exactly one solution; if it did, then by our the- 
orem it would be invertible. Recalling that linear equations have either one solution, 
infinite solutions, or no solution, we are left with the latter options when A is not in- 
vertible. This idea is important and so we'll state it again as a Key Idea. 
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Key Idea 11 Solutions to Ax = b and the Invertibility of A 


Consider the system of linear equations Ax = b. 


1. IfAis invertible, then Ax = b has exactly one solution, 
namely A~‘b. 


. If Ais not invertible, then Ax = b has either infinite 
solutions or no solution. 


In Theorem 9 we’ve come up with a list of ways in which we can tell whether or 
not a matrix is invertible. At the same time, we have come up with a list of properties 
of invertible matrices — things we know that are true about them. (For instance, if we 
know that A is invertible, then we know that Ax = b has only one solution.) 

We now go on to discover other properties of invertible matrices. Specifically, we 
want to find out how invertibility interacts with other matrix operations. For instance, 
if we know that A and Bare invertible, what is the inverse of A+ B? What is the inverse 
of AB? What is “the inverse of the inverse?” We'll explore these questions through an 
example. 


Example 58 Let 
a 2 2 0 
As E | nd6—| : 4 
Find: 
LA 3. (AB)~* 5. (A+B)? 
pa AA 2% 6. (5A)~} 


In addition, try to find connections between each of the above. 


SOLUTION 


1. Computing A~* is straightforward; we’ll use Theorem 7. 


eret[t 2]. [8 20) 


3/0 3 o 8061 


2. We compute B? in the same way as above. 
gee Dh Ola 2 
fay | i 2 1/2 4 
3. To compute (AB) ~*, we first compute AB: 
3 2 —2 0 —4 2 
=| alls Ale ‘| 
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We now apply Theorem 7 to find (AB)~?. 
aay" “]-[ae zal 
4. To compute (A~!)~?, we simply apply Theorem 7 to A?: 
wodule 28)-[6 2] 
5. To compute (A + B)~1, we first compute A + B then apply Theorem 7: 


eo (2 1-2 J 


(aseyt=F)?, Pal =! 


Hence 


Our last expression is really nonsense; we know that if ad — bc = 0, then the 
given matrix is not invertible. That is the case with A + B, so we conclude that 
A + Bis not invertible. 


6. To compute (5A)~?, we compute 5A and then apply Theorem 7. 
(sa) = (25 2° “* 4/5 10] [1/15 -2/15 
7 0 5 ~~ 75/0 15] | O 1/5 


We now look for connections between A~?, B~?, (AB)~1, (A~?)~1 and (A+ B)7?. 


3. Is there some sort of relationship between (AB)~+ and A~? and B~?? A first 
guess that seems plausible is (AB)~+ = A~1B-?. |s this true? Using our work 
from above, we have 


get = ke | lee | _ Es a | . 


0 1 2. 4 12. 4 


Obviously, this is not equal to (AB) ~+. Before we do some further guessing, let’s 
think about what the inverse of AB is supposed to do. The inverse — let’s call it 
C-—is supposed to be a matrix such that 


(AB)C = C(AB) = 1. 


|’ 


In examining the expression (AB)C, we see that we want B to somehow “cance 
with C. What “cancels” B? An obvious answer is B-?. This gives us a thought: 
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perhaps we got the order of A_ and B-? wrong before. After all, we were hop- 
ing to find that 


ABA~*B-+ = 1, 


but algebraically speaking, it is hard to cancel out these terms.2? However, 
switching the order of A~? and B~* gives us some hope. Is (AB)~t = B-*A~?? 
Let’s see. 


(AB)(B-*A~*) = A(BB~*)A~* (regrouping by the associative property) 


= AIA~* (BB-* = 1) 
= AA (Al = A) 
=] (AA + =1) 


Thus it seems that (AB)~? = B~1A~?. Let’s confirm this with our example ma- 


trices. 
mee TP P]-[at 3e]-a 
It worked! 


4. Is there some sort of connection between (A~*)~* and A? The answer is pretty 
obvious: they are equal. The “inverse of the inverse” returns one to the original 
matrix. 


5. Is there some sort of relationship between (A+ B)~1, A~* and B~? Certainly, if 
we were forced to make a guess without working any examples, we would guess 
that 


(A+B) 3=2a14 B74, 


However, we saw that in our example, the matrix (A + B) isn’t even invertible. 
This pretty much kills any hope of a connection. 


6. Is there a connection between (5A)~1 and A~*? Consider: 


apt [285 25) 
_1i ies | 
5| 0 1/5 
1 
= aaa 


Yes, there is a connection! 


Let’s summarize the results of this example. If A and B are both invertible matrices, 
then so is their product, AB. We demonstrated this with our example, and there is 
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more to be said. Let’s suppose that A and B aren x n matrices, but we don’t yet know 
if they are invertible. If AB is invertible, then each of A and B are; if AB is not invertible, 
then A or B is also not invertible. 


In short, invertibility “works well” with matrix multiplication. However, we saw 
that it doesn’t work well with matrix addition. Knowing that A and B are invertible 
does not help us find the inverse of (A + 8); in fact, the latter matrix may not even be 
invertible. 


Let’s do one more example, then we’ll summarize the results of this section in a 
theorem. 


2 0 O 
Example 59 FindtheinverseofA=|0 3 O 
0 0 -—7 
SOLUTION We'll find A~* using Key Idea 10. 
20 0 100 100 1/2 O 0 
-- > 
03 0 010 ref |}0 10 0 1/3 #O 
00-7001 001 0 0 —1/7 
Therefore 
1/2 0 0 
A=) 0 47s oO 


The matrix A in the previous example is a diagonal matrix: the only nonzero entries 
of A lie on the diagonal.”? The relationship between A and A? in the above example 
seems pretty strong, and it holds true in general. We'll state this and summarize the 
results of this section with the following theorem. 


22The fact that invertibility works well with matrix multiplication should not come as a surprise. After 
all, saying that A is invertible makes a statement about the mulitiplicative properties of A. It says that | can 
multiply A with a special matrix to get /. Invertibility, in and of itself, says nothing about matrix addition, 
therefore we should not be too surprised that it doesn’t work well with it. 

23We still haven’t formally defined diagonal, but the definition is rather visual so we risk it. See Definition 
20 on page 123 for more details. 
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Theorem 10 Properties of Invertible Matrices 
Let A and B ben x ninvertible matrices. Then: 
1. AB is invertible; (AB)~? = B-1A~?. 


2. A’ is invertible; (A~1)-1 = A. 


3. nA is invertible for any nonzero scalar n; (nA)~? = 
1,-1 
an 


. If A is a diagonal matrix, with diagonal entries 
di,d2,--- ,d,, where none of the diagonal en- 
tries are 0, then A~! exists and is a diagonal ma- 
trix. Furthermore, the diagonal entries of A-' are 
1/d1,1/dz,--+ ,1/dh. 


Furthermore, 


1. If a product AB is not invertible, then A or B is not in- 
vertible. 


2. If A or B are not invertible, then AB is not invertible. 


We end this section with a comment about solving systems of equations “in real 
life’24 Solving a system AX = b by computing A~1b seems pretty slick, so it would 
make sense that this is the way it is normally done. However, in practice, this is rarely 
done. There are two main reasons why this is the case. 

First, computing A~? and A~“b is “expensive” in the sense that it takes up a lot of 
computing time. Certainly, our calculators have no trouble dealing with the 3 x 3 cases 
we often consider in this textbook, but in real life the matrices being considered are 
very large (as in, hundreds of thousand rows and columns). Computing A~* alone is 
rather impractical, and we waste a lot of time if we come to find out that A~? does 
not exist. Even if we already know what A~? is, computing A~"b is computationally 
expensive — Gaussian elimination is faster. 

Secondly, computing A~* using the method we’ve described often gives rise to 
numerical roundoff errors. Even though computers often do computations with an 
accuracy to more than 8 decimal places, after thousands of computations, roundoffs 


24Ves, real people do solve linear equations in real life. Not just mathematicians, but economists, engi- 
neers, and scientists of all flavors regularly need to solve linear equations, and the matrices they use are 
often huge. 

Most people see matrices at work without thinking about it. Digital pictures are simply “rectangular 
arrays” of numbers representing colors — they are matrices of colors. Many of the standard image processing 
operations involve matrix operations. The author’s wife has a “7 megapixel” camera which creates pictures 
that are 3072 x 2304 in size, giving over 7 million pixels, and that isn’t even considered a “large” picture 
these days. 
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can cause big errors. (A “small” 1,000 x 1,000 matrix has 1, 000, 000 entries! That’s 
a lot of places to have roundoff errors accumulate!) It is not unheard of to have a 
computer compute A~* for a large matrix, and then immediately have it compute AA * 


and not get the identity matrix.”° 


Therefore, in real life, solutions to AX = bare usually found using the methods we 
learned in Section 2.4. It turns out that even with all of our advances in mathematics, 
it is hard to beat the basic method that Gauss introduced a long time ago. 


Exercises 2.7 


In Exercises 1-4, matrices A and B are given. 


Compute (AB)~* and B-'A~?. 


1,.A= : + B= t 
Baal ee) 
noe [E fh 
4,.A= | ; al B= 
In Exercises 5 -8 a : 
Compute A~’ and (A~*)~* using Theorem 7. 
5.a= - | 
ve) 
ree 


NO Fer NN N Ww 


6 


2 x 2 matrix A is given. 


[33 


. Find2x2 matrices A and B that are each 


invertible, but A + B is not. 


. Create a random 6 x 6 matrix A, then 


have a calculator or computer compute 
AA". Was the identity matrix returned 
exactly? Comment on your results. 


. Use a calculator or computer to com- 


pute AA~', where 


1 2 3 4 
ees 1 4 9 16 

1 8 27 64 

1 16 81 256 


Was the identity matrix returned ex- 
actly? Comment on your results. 


25The result is usually very close, with the numbers on the diagonal close to 1 and the other entries near 


0. But it isn’t exactly the identity matrix. 


OPERATIONS ON MATRICES 


In the previous chapter we learned about matrix arithmetic: adding, subtracting, and 
multiplying matrices, finding inverses, and multiplying by scalars. In this chapter we 
learn about some operations that we perform on matrices. We can think of them as 
functions: you input a matrix, and you get something back. One of these operations, 
the transpose, will return another matrix. With the other operations, the trace and 
the determinant, we input matrices and get numbers in return, an idea that is different 
than what we have seen before. 


3.1 The Matrix Transpose 
1. T/F: If Aisa3 x 5 matrix, then A’ willbe a5 x 3 matrix. 
2. Where are there zeros in an upper triangular matrix? 
3. T/F: A matrix is symmetric if it doesn’t change when you take its transpose. 
4. What is the transpose of the transpose of A? 


5. Give 2 other terms to describe symmetric matrices besides “interesting.” 


We jump right in with a definition. 
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Definition 19 Transpose 


Let A be anm x n matrix. The tranpsose of A, denoted A’, 
is the n x m matrix whose columns are the respective rows 
of A. 


Examples will make this definition clear. 


Example 60 Find the transpose of A = F : | : 
SOLUTION Note that A is a 2 x 3 matrix, so A’ will be a3 x 2 matrix. By the 


definition, the first column of A’ is the first row of A; the second column of A’ is the 
second row of A. Therefore, 


1 4 
Aa 19. Ss 
3 6 
Example 61 Find the transpose of the following matrices. 
7 2 9 1 1 10 -2 
A=|/2 -1 3 0] B=/3 -5 7 c=[1 -1 7 8 3] 
5 3 0 11 4 2 -3 
SOLUTION We find each transpose using the definition without explanation. 


Make note of the dimensions of the original matrix and the dimensions of its transpose. 


1 
7 2 <5 

2 ioe & . 1 3 44 _ ia 

A=15 3 9 B’=|10 -5 2 c=|7 

<p 7 =3 8 

i -O. 4 


Notice that with matrix B, when we took the transpose, the diagonal did not change. 
We can see what the diagonal is below where we rewrite B and B’ with the diagonal in 
bold. We’ll follow this by a definition of what we mean by “the diagonal of a matrix,” 
along with a few other related definitions. 


1 10 —2 1 3 4 
B=/3 5 7 B’=/10 -5 2 
4 2 -3 2 7 -3 


It is probably pretty clear why we call those entries “the diagonal.” Here is the 
formal definition. 


3.1 The Matrix Transpose 


Definition 20 The Diagonal, a Diagonal Matrix, Triangular Matrices 


Let A be an m x n matrix. The diagonal of A consists of the 
entries Q11, Q22,... Of A. 


A diagonal matrix is an n x n matrix in which the only 
nonzero entries lie on the diagonal. 


An upper (lower) triangular matrix is a matrix in which any 
nonzero entries lie on or above (below) the diagonal. 


Example 62 Consider the matrices A, B, C and /,, as well as their transposes, 
where 
1 2 3 3 0 O ; ; : 
A=|0 4 5 B=|]0 7 0O C= 00 6 
0 0 6 0 0 1 —. 


Identify the diagonal of each matrix, and state whether each matrix is diagonal, upper 
triangular, lower triangular, or none of the above. 


SOLUTION We first compute the transpose of each matrix. 
100 3 0 O 100 0 
At=|2 4 0| BF =10 7 O C=|2 4.0 0 
3 5 6 0 0 -1 3 5 6 0 


Note that /, = Ia. 

The diagonals of A and A’ are the same, consisting of the entries 1, 4 and 6. The 
diagonals of B and B’ are also the same, consisting of the entries 3, 7 and —1. Finally, 
the diagonals of Cand C’ are the same, consisting of the entries 1, 4 and 6. 

The matrix A is upper triangular; the only nonzero entries lie on or above the diag- 
onal. Likewise, A’ is lower triangular. 

The matrix B is diagonal. By their definitions, we can also see that B is both upper 
and lower triangular. Likewise, /, is diagonal, as well as upper and lower triangular. 

Finally, Cis upper triangular, with C’ being lower triangular. 


Make note of the definitions of diagonal and triangular matrices. We specify that 
a diagonal matrix must be square, but triangular matrices don’t have to be. (“Most” 
of the time, however, the ones we study are.) Also, as we mentioned before in the 
example, by definition a diagonal matrix is also both upper and lower triangular. Fi- 
nally, notice that by definition, the transpose of an upper triangular matrix is a lower 
triangular matrix, and vice-versa. 
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There are many questions to probe concerning the transpose operations.? The 
first set of questions we’ll investigate involve the matrix arithmetic we learned from 
last chapter. We do this investigation by way of examples, and then summarize what 
we have learned at the end. 


Example 63 Let 


Find A’ + B’ and (A+ B)’. 


SOLUTION We note that 
1 4 1 3 
A’=|]2 5] andB’=|2 —-1 
3 6 1 O 
Therefore 
1 4 1 3 
A’+B'=|/2 5|/+1]2 -1 
3 6 1 O 
2 7 
=14 4 
4 6 
Also, 
i'2 3 i 2 aly" 
Peas 
ere) ar 5 s|+|3 -1 a 
fle & aly 
_ 7 4 6 
2 7 
=14 4]. 
4 6 


It looks like “the sum of the transposes is the transpose of the sum.”? This should 
lead us to wonder how the transpose works with multiplication. 


Example 64 Let 


1 -2 1 2 -1 
A= and B = : 
3 4 1 0 1 
1Remember, this is what mathematicians do. We learn something new, and then we ask lots of questions 
about it. Often the first questions we ask are along the lines of “How does this new thing relate to the old 
things | already know about?” 


124 2This is kind of fun to say, especially when said fast. Regardless of how fast we say it, we should think 
about this statement. The “is” represents “equals.” The stuff before “is” equals the stuff afterwards. 


3.1 The Matrix Transpose 


Find (AB)', A’B’ and BTA’. 


SOLUTION We first note that 
3 1 1 
al, :| andB’=| 2 O 
—1 1 
Find (AB)’: 
7 
r_(f1 2]f1 2 -1 
(aay = ([3 4|}|1 0 1 
oe ee 
> 7 6 1 
3 7 
=|2 6 
11 


Now find A’B’: 


= Not defined! 


So we can’t compute A’B’. Let’s finish by computing BA’: 


We may have suspected that (AB)’ = A’B’. We saw that this wasn’t the case, 
though — and not only was it not equal, the second product wasn’t even defined! Oddly 
enough, though, we saw that (AB)’ = B’A’. ? To help understand why this is true, 
look back at the work above and confirm the steps of each multiplication. 

We have one more arithmetic operation to look at: the inverse. 


Example 65 Let 


3Then again, maybe this isn’t all that “odd.” It is reminiscent of the fact that, when invertible, (AB)~1 = 
B-14-1, 
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Find (A~*)! and (A‘)~?. 
SOLUTION We first find A~* and A’: 


1 [4 -7 7_[2 1 
A =|" 7 | anaat = | 3 al: 


Finding (A~*)?: 


Finding (A’)~?: 


It seems that “the inverse of the transpose is the transpose of the inverse.”4 


We have just looked at some examples of how the transpose operation interacts 
with matrix arithmetic operations.” We now give a theorem that tells us that what we 
saw wasn’t a coincidence, but rather is always true. 


Theorem 11 Properties of the Matrix Transpose 


Let A and B be matrices where the following operations are 
defined. Then: 
1. (A+B)’ =A’ +B" and (A—B)' =A! — 


ae Sa 


= (At)? 


( 
- ( 
- ( 
GS 
- ( 


We included in the theorem two ideas we didn’t discuss already. First, that (kKA)™ = 


4 Again, we should think about this statement. The part before “is” states that we take the transpose of 
a matrix, then find the inverse. The part after “is” states that we find the inverse of the matrix, then take 
the transpose. Since these two statements are linked by an “is,” they are equal. 

5These examples don’t prove anything, other than it worked in specific examples. 
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kA’. This is probably obvious. It doesn’t matter when you multiply a matrix by a scalar 
when dealing with transposes. 

The second “new” item is that (A’)’ = A. That is, if we take the transpose of a 
matrix, then take its transpose again, what do we have? The original matrix. 

Now that we know some properties of the transpose operation, we are tempted 
to play around with it and see what happens. For instance, if A is anm x n matrix, we 
know that A’ is ann x m matrix. So no matter what matrix A we start with, we can 
always perform the multiplication AA’ (and also A"A) and the result is a square matrix! 

Another thing to ask ourselves as we “play around” with the transpose: suppose A 
is a square matrix. Is there anything special about A + A’? The following example has 
us try out these ideas. 


Example 66 Let 
2 1 3 
A=|]2 -1 1 
101 
Find AA’, A+ A’ andA—A’. 
SOLUTION Finding AA’: 
2 1 3 2 2 1 
AA'T=|2 -1 1 1-1 0 
1 0 1//3 1 1 
146 5 
=|6 4 3 
5 3 2 
Finding A +A’: 
2 1 3 2 2 1 
A+A’=|]2 -1 1] +]1 -1 0 
1 0 1 3 1 1 
2 3 4 
=|/|3 —2 1 
4 12 
Finding A — A’: 
2 1 3 2 2 1 
A-A'T=]2 -1 1 1 -1 0 
1 0 1 3 1 1 
0 -1 2 
=/1 01 
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Let’s look at the matrices we’ve formed in this example. First, consider AA’. Some- 
thing seems to be nice about this matrix — look at the location of the 6’s, the 5’s and 
the 3’s. More precisely, let’s look at the transpose of AA’. We should notice that if we 
take the transpose of this matrix, we have the very same matrix. That is, 


T 


We'll formally define this in a moment, but a matrix that is equal to its transpose 
is called symmetric. 

Look at the next part of the example; what do we notice about A + A’? We should 
see that it, too, is symmetric. Finally, consider the last part of the example: do we 
notice anything about A — A’? 

We should immediately notice that it is not symmetric, although it does seem 
“close.” Instead of it being equal to its transpose, we notice that this matrix is the 
opposite of its transpose. We call this type of matrix skew symmetric.© We formally 
define these matrices here. 


Definition 21 Symmetric and Skew Symmetric Matrices 


A matrix A is symmetric if AT = A. 


A matrix A is skew symmetric if A’ = —A. 


Note that in order for a matrix to be either symmetric or skew symmetric, it must 
be square. 


So why was AA! symmetric in our previous example? Did we just luck out?” Let’s 
take the transpose of AA’ and see what happens. 


(AA’)" => (A‘)T(A)" transpose multiplication rule 


= AAT ayr=a 


We have just proved that no matter what matrix A we start with, the matrix AA’ will 
be symmetric. Nothing in our string of equalities even demanded that A be a square 
matrix; it is always true. 

We can do a similar proof to show that as long as A is square, A+A’ is asymmetric 
matrix.2 We’ll instead show here that if A is a square matrix, then A — A’ is skew 


&Some mathematicians use the term antisymmetric 
7Of course not. 
8Why do we say that A has to be square? 
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symmetric. 


(A —= A’)! = Ar = (A‘)? transpose subtraction rule 


So we took the transpose of A — A’ and we got —(A — A’); this is the definition of 
being skew symmetric. 

We’ll take what we learned from Example 66 and put it in a box. (We’ve already 
proved most of this is true; the rest we leave to solve in the Exercises.) 


Theorem 12 Symmetric and Skew Symmetric Matrices 


. Given any matrix A, the matrices AA’ and A’A are sym- 
metric. 


. Let A be a square matrix. The matrix A + A’ is sym- 
metric. 


. Let A be a square matrix. The matrix A — A’ is skew 
symmetric. 


Why do we care about the transpose of a matrix? Why do we care about symmetric 
matrices? 

There are two answers that each answer both of these questions. First, we are 
interested in the tranpose of a matrix and symmetric matrices because they are in- 
teresting.’ One particularly interesting thing about symmetric and skew symmetric 
matrices is this: consider the sum of (A + A’) and (A — A’): 


(A +A‘) + (A—A') =2A. 


This gives us an idea: if we were to multiply both sides of this equation by ;, then the 
right hand side would just be A. This means that 


A= 5A +A‘) + aC —A’). 
—+_-_—4 —+_—-_——4” 
symmetric skew symmetric 
That is, any matrix A can be written as the sum of a symmetric and skew symmetric 
matrix. That’s interesting. 
The second reason we care about them is that they are very useful and important in 
various areas of mathematics. The transpose of a matrix turns out to be an important 


90r: “neat,” “cool,” “bad,” “wicked,” “phat,” “fo-shizzle.” 
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operation; symmetric matrices have many nice properties that make solving certain 
types of problems possible. 

Most of this text focuses on the preliminaries of matrix algebra, and the actual uses 
are beyond our current scope. One easy to describe example is curve fitting. Suppose 
we are given a large set of data points that, when plotted, look roughly quadratic. How 
do we find the quadratic that “best fits” this data? The solution can be found using 
matrix algebra, and specifically a matrix called the pseudoinverse. If A is a matrix, the 
pseudoinverse of A is the matrix A‘ = (ATA)~1A7 (assuming that the inverse exists). 
We aren’t going to worry about what all the above means; just notice that it has a cool 
sounding name and the transpose appears twice. 

In the next section we’ll learn about the trace, another operation that can be per- 
formed on a matrix that is relatively simple to compute but can lead to some deep 
results. 


Exercises 3.1 


In Exercises 1-24, a matrix A is given. Find 4 -5 2 
A’; make note if A is upper/lower triangular, 2. );1 5 9 
diagonal, symmetric and/or skew symmetric. 9 2 3 
‘ [ —7 4 4 0 -2 
“| 4 —6 13. 0 2 3 
ae —2 3 6 
7 lems :| O. Bo. 2 
rar, 14. | 3 -4 1 
3 Lo | —2 1 0 
a le = 15 E a zs 
—-3 1 i 
; 7 -4 —10 
-5 -9 
ae oe 0 6 it 
>, _8 16. | 6 0 64 
—-1 -4 0 
—2 10 
6. | 1 —7 a 
| 9 -2 17. 5 -4 —-10 
Z —6 6 9 
axlices “a | 
“1-9 6 3 -9 4 0 O 
: 19) b=. —F 20 
3 | 2. —10 0 ‘| 4 25 
—10 -2 -3 1 
—-3 -4 -5 
9. [-7 -8 2 -3] 19. E -3 5 
10..[-9 8 2 -7] 0 oO -3 
-9 4 10 6 -7 2 6 
11. 6 -3 —-7 20. |0 -8 -1 0 
-8 1 -1 0 0 1 —-7 
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1 0 O 0 1 —2 
21. |0 2 O 23. |—-1 O 4 
0 0 -1 2 -4 0O 
6 —-4 —-5 0 0 O 
22. |—-4 0 2 24. |0 0 0O 
-5 2 -2 0 0 O 


3.2 The Matrix Trace 


AS YOU READ 


1. T/F: We only compute the trace of square matrices. 


2. T/F: One can tell if a matrix is invertible by computing the trace. 


Inthe previous section, we learned about an operation we can peform on matrices, 
namely the transpose. Given a matrix A, we can “find the transpose of A,” which is 
another matrix. In this section we learn about a new operation called the trace. It is 
a different type of operation than the transpose. Given a matrix A, we can “find the 
trace of A,” which is not a matrix but rather a number. We formally define it here. 


Definition 22 The Trace 


Let A be ann x n matrix. The trace of A, denoted tr(A), is 
the sum of the diagonal elements of A. That is, 


tr(A) = aay + ao2 +--+ + Onn. 


This seems like a simple definition, and it really is. Just to make sure it is clear, let’s 
practice. 


Example 67 Find the trace of A, B, C and /4, where 
1 2 O 
a=|3 |. 8= 3 8 #1 and c= | : eI 
—2 7 -5 
SOLUTION To find the trace of A, note that the diagonal elements of A are 1 


and 4. Therefore, tr(A) =1+4=5. 
We see that the diagonal elements of B are 1, 8 and-5, sotr(B)=14+8—5=4. 
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The matrix C is not a square matrix, and our definition states that we must start 
with a square matrix. Therefore tr(C) is not defined. 


Finally, the diagonal of I, consists of four 1s. Therefore tr(/q) = 4. 


Now that we have defined the trace of a matrix, we should think like mathemati- 
cians and ask some questions. The first questions that should pop into our minds 
should be along the lines of “How does the trace work with other matrix operations?” © 
We should think about how the trace works with matrix addition, scalar multiplication, 
matrix multiplication, matrix inverses, and the transpose. 


We’ll give a theorem that will formally tell us what is true in a moment, but first 
let’s play with two sample matrices and see if we can see what will happen. Let 


2 1 3 2 0 1 
A=|2 0 -1] andB=]|]-1 2 O 
3 -1 3 0 2 -1 


2 1 3 2 0 1 
tr(A + B) =tr 2 0 -1/;+);]-1 2 O 
3 -1 3 Oo 2 -1 
4 1 4 
=tr 1 2 -1 
3 1 2 
=8 


So we notice that tr(A + B) = tr(A) + tr(B). This probably isn’t a coincidence. 


How does the trace work with scalar multiplication? If we multiply A by 4, then the 
diagonal elements will be 8, 0 and 12, so tr(4A) = 20. Is it a coincidence that this is 4 
times the trace of A? 


Let’s move on to matrix multiplication. How will the trace of AB relate to the traces 
of A and B? Let’s see: 


2 #1 3 2 0 1 
tr(AB)=tr| |2 0 —-1 -1 2 0 
3 -1 3 0 2 -1 
3 8 -1 
=tr 4 -2 3 
7 4 0 
= 1 


10Recall that we asked a similar question once we learned about the transpose. 
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It isn’t exactly clear what the relationship is among tr(A), tr(B) and tr(AB). Before 
moving on, let’s find tr(BA): 


2 0 1 2 1 3 
tr(BA)=tr| |—-1 2 0 2 0 -1 
0 2 -1 3 -1 3 

7 1 9 

=tr 2 -1 —-5 

1 #1 -5 


We notice that tr(AB) = tr(BA). Is this coincidental? 
How are the traces of A and A~* related? We compute A~? and find that 


1/17 6/17. 1/17 
A-t=|9/17. 3/17. —8/17 
2/17 =—5/17 2/17 
Therefore tr(A~ +) = 6/17. Again, the relationship isn’t clear.1" 

Finally, let’s see how the trace is related to the transpose. We actually don’t have 
to formally compute anything. Recall from the previous section that the diagonals 
of A and A’ are identical; therefore, tr(A) = tr(A’). That, we know for sure, isn’t a 
coincidence. 

We now formally state what equalities are true when considering the interaction 
of the trace with other matrix operations. 


Theorem 13 Properties of the Matrix Trace 


Let A and B ben x n matrices. Then: 
. tr(A + B) = tr(A) + tr(B) 
. tr(A — B) = tr(A) — tr(B) 
. tr(kA) = k-tr(A) 


. tr(AB) = tr(BA) 


. tr(A’) = tr(A) 


One of the key things to note here is what this theorem does not say. It says nothing 
about how the trace relates to inverses. The reason for the silence in these areas is 
that there simply is not a relationship. 


11Something to think about: we know that not all square matrices are invertible. Would we be able to 
tell just by the trace? That seems unlikely. 
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We end this section by again wondering why anyone would care about the trace of 
matrix. One reason mathematicians are interested in it is that it can give a measure- 
ment of the “size”?? of a matrix. 

Consider the following 2 x 2 matrices: 


1 -2 6 7 
A= Ei 1 and B = e EE 
These matrices have the same trace, yet B clearly has bigger elements in it. So how 
can we use the trace to determine a “size” of these matrices? We can consider tr(A’A) 


and tr(B’B). 
er(ata) =er([ 4, ake eal 


al )) 


ej tee wee 
x —2 65 
= 222 
Our concern is not how to interpret what this “size” measurement means, but 


rather to demonstrate that the trace (along with the transpose) can be used to give 
(perhaps useful) information about a matrix. 


12There are many different measurements of a matrix size. In this text, we just refer to its dimensions. 
Some measurements of size refer the magnitude of the elements in the matrix. The next section describes 
yet another measurement of matrix size. 
13This example brings to light many interesting ideas that we'll flesh out just a little bit here. 
1. Notice that the elements of A are 1, —2, 1 and 1. Add the squares of these numbers: 1? + (—2)? + 
41? =7=tr(AlA). 
Notice that the elements of B are 6, 7, 11 and -4. Add the squares of these numbers: 62 + 72 + 
112 + (—4)? = 222 = tr(B’B). 
Can you see why this is true? When looking at multiplying A’A, focus only on where the elements 
on the diagonal come from since they are the only ones that matter when taking the trace. 


2. You can confirm on your own that regardless of the dimensions of A, tr(A’A) = tr(AA’). To see why 
this is true, consider the previous point. (Recall also that A’A and AA’ are always square, regardless 
of the dimensions of A.) 

3. Mathematicians are actually more interested in \/tr(A’A) than just tr(A’A). The reason for this is a 
bit complicated; the short answer is that “it works better.” The reason “it works better” is related to 
the Pythagorean Theorem, all of all things. If we know that the legs of a right triangle have length 
a and b, we are more interested in \/ a2 + b* than just a2 + b2. Of course, this explanation raises 
more questions than it answers; our goal here is just to whet your appetite and get you to do some 
more reading. A Numerical Linear Algebra book would be a good place to start. 


3.3. The Determinant 


Exercises 3.2 


In Exercises 1-15, find the trace of the given 5 2 2 2 
matrix. 7 4 —-7 —3 
an ee ae oe 
1 1 -5 
slg -« -4 8 -8 -2 
5 (= 13. In 
‘|-6 4 
14. |, 
3 7 5 
“|-5 —4 15. A matrix A that is skew symmetric. 
—6 0O : , 
4. ae | In Exercises 16 — 19, verify Theorem 13 by: 
4 1 1 1. Showing that tr(A)+tr(B) = tr(A + B) 
5. |-2 0 O and 
—1 -2 -5 2. Showing that tr(AB) = tr(BA). 
0 -—3 1 
Ge) ae 8 16. 4=| ee ee | 
ae a. a 9 -6 6 3 
—2 -3 5 0 -8 4 55 
she = va=[? Ff]. e=[22 5] 
—1 -3 1 
4 2 4 —8 -10 10 
g. | —4 1 18. A= 10 5 —6 
0 -5 5 -10 1 3 
9 2 6 4 —10 -4 -3 
a ee | ae Se 
3 7 3 
6 5 
10. | 2 10 —10 7 5 
3 3 19. A= 7 7 —-5 
16 6 <7 =9 er = 
11 —2 1 6 -—9 —-3 -4 9 
“| 0 4 -4 0 B=|4 -1 -9 
3 -9 3 -10 -7 -8 10 


3.3. The Determinant 


AS YOU READ 


1. T/F: The determinant of a matrix is always positive. 


2. T/F: To compute the determinant of a 3 x 3 matrix, one needs to compute the 
determinants of 3 2 x 2 matrices. 


3. Give an example of a2 x 2 matrix with a determinant of 3. 


135 


Chapter 3 Operations on Matrices 


136 


In this chapter so far we’ve learned about the transpose (an operation on a ma- 
trix that returns another matrix) and the trace (an operation on a square matrix that 
returns a number). In this section we’ll learn another operation on square matrices 
that returns a number, called the determinant. We give a pseudo-definition of the 
determinant here. 


The determinant of ann x n matrix A is a number, denoted 
det(A), that is determined by A. 


That definition isn’t meant to explain everything; it just gets us started by making 
us realize that the determinant is a number. The determinant is kind of a tricky thing 
to define. Once you know and understand it, it isn’t that hard, but getting started is a 
bit complicated.14 We start simply; we define the determinant for 2 x 2 matrices. 


Definition 23 Determinant of 2 x 2 Matrices 


A=|? ale 


Let 


The determinant of A, denoted by 


a 
det (A) or : 


b 
d 2 


We’ve seen the expression ad — bc before. In Section 2.6, we saw that a2 x 2 matrix 
A has inverse 
1 d -—b 
ad—bc|—-c a 


as long as ad — bc 4 0; otherwise, the inverse does not exist. We can rephrase the 
above statement now: If det(A)4 0, then 


“=a [“e o | 


A brief word about the notation: notice that we can refer to the determinant by us- 
ing what looks like absolute value bars around the entries of a matrix. We discussed at 
the end of the last section the idea of measuring the “size” of a matrix, and mentioned 
that there are many different ways to measure size. The determinant is one such way. 
Just as the absolute value of a number measures its size (and ignores its sign), the de- 
terminant of a matrix is a measurement of the size of the matrix. (Be careful, though: 
det(A) can be negative!) 

Let’s practice. 


M4 It’s similar to learning to ride a bike. The riding itself isn’t hard, it is getting started that’s difficult. 


3.3. The Determinant 


Example 68 Find the determinant of A, B and C where 
1 2 3 -1 1 -3 
=e 4 |e8= | 7 | ande= | 7, aap 
SOLUTION Finding the determinant of A: 


det (A) = | 
= 1(4) - 2(3) 
=-2. 


Similar computations show that det (B) = 3(7) — (—1)(2) = 23 and det (C) = 
1(6) — (—3)(—2) =0. 


Finding the determinant of a 2 x 2 matrix is pretty straightforward. It is natural to 
ask next “How do we compute the determinant of matrices that are not 2 x 22” We 
first need to define some terms. 

Definition 24 Matrix Minor, Cofactor 
Let A be ann x n matrix. The i, j minor of A, denoted A;;, is 
the determinant of the (n — 1) x (n — 1) matrix formed by 


deleting the i” row and j” column of A. 


The j, j-cofactor of A is the number 


Cy = (1) Ai. 


Notice that this definition makes reference to taking the determinant of a matrix, while 
we haven't yet defined what the determinant is beyond 2 x 2 matrices. We recognize 
this problem, and we’ll see how far we can go before it becomes an issue. 

Examples will help. 


Example 69 Let 
1 2 3 : : ; : 
A=|4 5 6] andB= 
739 -1 9 -4 6 
1 1 11 


Find A1.3, A3,2, Bz,1, Ba,3 and their respective cofactors. 


15This is the standard definition of these two terms, although slight variations exist. 
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SOLUTION To compute the minor A; 3, we remove the first row and third col- 
umn of A then take the determinant. 
12 3 42 3 45 
a=|45 6/s]4a 5 6/>[% §| 
7 8 9 7 8 9 
4 5 
Aaa] 5 |= 22 35=-3 


The corresponding cofactor, C; 3, is 
C13 = (—1)'47A1,3 = (—1)*(-3) = 3. 


The minor A3 2 is found by removing the third row and second column of A then 
taking the determinant. 


The corresponding cofactor, C3 2, is 


Ca.9 = (-1)3+7Ag 9 = (—1)9(-6) = 6. 


The minor Bz, is found by removing the second row and first column of B then 
taking the determinant. 


L 2 Oo 8 +2 0 8 
e-/-3 5 7 2},]/3 5 7 af ]§ 2 E 
-1 9 -4 6 49 -4 6 ; 4-4 
. 42.2 4 #111 
2 0 8|, 
Bi=|9 -4 6|=? 
La 4 


We’re a bit stuck. We don’t know how to find the determinate of this 3 x 3 matrix. 
We'll come back to this later. The corresponding cofactor is 


Co. = (—1)?**B21 = —B2,1, 


whatever this number happens to be. 
The minor Bg.3 is found by removing the fourth row and third column of B then 
taking the determinant. 


1 2 0 8 1 2 0 8 
e-(73 5 7 2/,]/-3 5 % 2;_,] 4, € 5 
~ |-1 9 -4 6 -1 9 4 6 Soo: 
1 abe a 444 4 
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Again, we’re stuck. We won't be able to fully compute Cy 3; all we know so far is that 
Ca3 = (—1)417By.3 = (—1)Ba,3. 


Once we learn how to compute determinates for matrices larger than 2 x 2 we can 
come back and finish this exercise. 


In our previous example we ran into a bit of trouble. By our definition, in order 
to compute a minor of ann x n matrix we needed to compute the determinant of a 
(n — 1) x (n—1) matrix. This was fine when we started with a 3 x 3 matrix, but when 
we got up to a 4 x 4 matrix (and larger) we run into trouble. 

We are almost ready to define the determinant for any square matrix; we need 
one last definition. 


Definition 25 Cofactor Expansion 


Let A be ann x n matrix. 


The cofactor expansion of A along the i‘ row is the sum 


G31C.4 + Oj2C,2 + +++ + AinCin- 
The cofactor expansion of A down the j” column is the sum 


Wn iCry + A2jOrj +--+ An jC: 


The notation of this definition might be a little intimidating, so let’s look at an ex- 
ample. 


Example 70 Let 


Sb 
II 
NAR 
ouwm nN 
On Ww 


Find the cofactor expansions along the second row and down the first column. 


SOLUTION By the definition, the cofactor expansion along the second row is 
the sum 


Q7,1Cr,1 + A2,2Co.2 + A7,3C23. 


(Be sure to compare the above line to the definition of cofactor expansion, and see 
how the “i” in the definition is replaced by “2” here.) 
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We’ll find each cofactor and then compute the sum. 


2 3 we removed the second row and 
8 9 = (—1)(—6) =6 first column of A to compute the 


minor 


C> 1= (-1)?4" 


we removed the second row and ) 


: | = (1)(—12) = —12 ( second column of A to compute 


the minor 


243 1 2 we removed the second row and 
O3= (-1) 7 8 = (—1)(—6) =6 third column of A to compute the 


minor 


Thus the cofactor expansion along the second row is 


21021 + 2,2Co,2 + a7,3C23 = 4(6) + 5(—12) + 6(6) 
= 24— 60+ 36 
=0 


At the moment, we don’t know what to do with this cofactor expansion; we’ve just 


successfully found it. 
We move on to find the cofactor expansion down the first column. By the defini- 


tion, this sum is 
Q4,1C1,1 + 2,1Co.1 + 03,1C3 1. 
(Again, compare this to the above definition and see how we replaced the “j” with “1.”) 
We find each cofactor: 


5 6 d the fi d fi 
ca. s\-Ocies Gare) 
Cr = (—1)?*" ; : | = (—1)(-6) =6 ( we computed this cofactor above ) 

2 3 d the third d fi 
asa, (O53 Gea) 


The cofactor expansion down the first column is 


41,1C1,1 + 021021 + a3,1C31 = 1(—3) + 4(6) + 7(-3) 
=i 9a 91 
=0 


Is it a coincidence that both cofactor expansions were 0? We’ll answer that in a 
while. 
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This section is entitled “The Determinant,” yet we don’t know how to compute it 
yet except for 2 x 2 matrices. We finally define it now. 


Definition 26 The Determinant 


The determinant of ann x n matrix A, denoted det(A) or |A|, 
is a number given by the following: 


¢ ifAisa1 x 1 matrix A = [a], then det (A) =a. 


e ifAisa2 x 2 matrix 


then det (A) = ad — be. 


¢ ifAisann xn matrix, where n > 2, then det(A) is the 
number found by taking the cofactor expansion along 
the first row of A. That is, 


det (A) = a1,1C1,1 + @1,2Ci2 + +++ + G1 nCin- 


Notice that in order to compute the determinant of ann x n matrix, we need to com- 
pute the determinants of n (n— 1) x (n— 1) matrices. This can be a lot of work. We'll 
later learn how to shorten some of this. First, let’s practice. 


Example 71 Find the determinant of 


b 
II 
NBR 
coum N 
on Ww 


SOLUTION Notice that this is the matrix from Example 70. The cofactor ex- 
pansion along the first row is 


det (A) = a4,1C11 + 01,2C1.2 + 41,3C13. 


We’ll compute each cofactor first then take the appropriate sum. 
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Cy = (-1)"**Ar Cy2 = (-1)'*7A1 C13 = (—1)'F7A1 3 
5 6 4 6 4 5 
=1-/3 a. -ay-] 5 7 1/4 8 
= 45 — 48 = (1)(86 = 49) ey ee 


Therefore the determinant of A is 


det (A) = 1(—3) + 2(6) +.3(—3) =0. 


Example 72 Find the determinant of 
3 6 7 
A=|]0 2 -1 
3 -1 1 
SOLUTION We'll compute each cofactor first then find the determinant. 
Cua = (1) "Ar C12 = (-1)'*7A1 2 Ci,3 = (—1)**3A1 3 
2 -1 Oo -1 0 2 
Aa] f mae] ] nes 4 
=2-1 = (-1)(0 +3) a6 
Thus the determinant is 
det (A) = 3(1) + 6(—3) + 7(—6) = —57. 
Example 73 Find the determinant of 
1 2 1 2 
—-1 2 3 4 
a 8 5 -3 1 
5 9 -6 3 
SOLUTION This, quite frankly, will take quite a bit of work. In order to compute 


this determinant, we need to compute 4 minors, each of which requires finding the 
determinant of a 3 x 3 matrix! Complaining won’t get us any closer to the solution, ‘° 


16But it might make us feel a little better. Glance ahead: do you see how much work we have to do?!? 
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so let’s get started. We first compute the cofactors: 


G1 (—1)**"Ai 4 


2 3 4 
=1.- 5 -3 1 eee 
9 -6 3 
-3 1 5 1 5 -3 
pou _4)\it1 . (—4)14+2 .(—4)143 
= 2-(—1) re 3 |t80 5 3 [ta (a) -— 
= 2(—3) + 3(—6) + 4(-3) 
= —36 
Ci2 = (-1)***A12 
—1 3 4 
he d i 
=(=1)*| 8 -3 4 Gimeeanax 
5 -6 3 
-3 1 8 1 8 -3 
= (— —1).(—1)1t1 _4)1+2 -(1)3 
=H [en-cn| 3 3 ]+a-com| 3 3 ]+ecn] 3 78 
—_—_—_—_—_—_—_—_—_—_———<_—&_——_—_—_—K—K—K—K—<—«K—«—«_—<—¥—<€_—¥—§—<—X——S—X—XaJ___-_" 


the determinate of the 3 x 3 matrix 


= 186 


1) [(—1)(—3) + 3(—19) 4 


4(—33)| 


—-1 2 4 
—] 8 5 1 eee all au HSER 
5 9 3 
a 8 1 8 5 
fi _4y\l41 __4y14+2 -(1)143 
=(--cayt] § 5] +2-care| 3 f+a-cae] oo 
= (—1)(6) + 2(—19) + 4(47) 
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—-1 2 3 
= (1) , 8 5 a3 eae aeeenie ee) 
5 9 -6 
5 -3 8 -3 8 
ff _ _4)i4+1 . (—1)1+2 . (—1)14+3 
= [ea-cn] § 3 |+2-com| 3 Tg |+3-cnt| § 
eet TF FO 


the determinate of the 3 x 3 matrix 
= (—1) [(—1)(—3) + 2(33) + 3(47)] 
= —210 


We’ve computed our four cofactors. All that is left is to compute the cofactor ex- 
pansion. 
det (A) = 1(—36) + 2(186) + 1(144) + 2(—210) = 60. 


As a way of “visualizing” this, let’s write out the cofactor expansion again but in- 
cluding the matrices in their place. 


det (A) = 01,111 + 01,2Ci,2 + 01,3013 + G1,4Cr4 


2 3 4 -1 3 4 
=1(-1)?} 5 -3 1]+2(-1)?} 8 -3 1 
9 -6 3 5 -6 3 
+ 
-1 24 -1 2 3 
1(—-1)*| 8 5 1] +2(-1)?| 8 5 -3 
5 9 3 5 9 -6 
= 144 = 210 
= 60 


That certainly took a while; it required more than 50 multiplications (we didn’t 
count the additions). To compute the determinant of a5 x 5 matrix, we’ll need to 
compute the determinants of five 4 x 4 matrices, meaning that we’ll need over 250 
multiplications! Not only is this a lot of work, but there are just too many ways to make 
silly mistakes.‘” There are some tricks to make this job easier, but regardless we see 
the need to employ technology. Even then, technology quickly bogs down. A 25 x 25 
matrix is considered “small” by today’s standards,"® but it is essentially impossible for 
a computer to compute its determinant by only using cofactor expansion; it too needs 
to employ “tricks.” 


17/The author made three when the above example was originally typed. 
18|t is common for mathematicians, scientists and engineers to consider linear systems with thousands 
of equations and variables. 
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In the next section we will learn some of these tricks as we learn some of the 
properties of the determinant. Right now, let’s review the essentials of what we have 
learned. 


1. The determinant of a square matrix is a number that is determined by the matrix. 


2. We find the determinant by computing the cofactor expansion along the first 
row. 


3. To compute the determinant of ann x n matrix, we need to compute n deter- 
minants of (n — 1) x (n — 1) matrices. 


Exercises 3.3 


In Exercises 1-8, find the determinant of the -5 -3 3 
2 x 2 matrix. 11. | -3 3 10 
<3 -9 3 9 
1 10 7 
“lg 9 -6 -4 6 
: 12. —8 0 0 
x18 al AiG. °8- at 
; |-7 8 In Exercises 13 — 24, find the determinant 
rey J of the given matrix using cofactor expansion 
3. 5 a along the first row. 
: 3 2 3 
ri [—10 -1 13. |-6 1 —10 
“| -4 7 -8 -9 -9 
: 8 10 8 -9 -2 
2 3 14 -9 9 -7 
5 -1 9 
g. | 10 -10 a 3-4 
a 15; | =a Bs 3S 
: [1 3 3 -4 5 
|7 7 1 -—2 1 
= 16./5 5 4 
aa. “8 a 0-0 
ines) 
: 1 -4 1 
In Exercises 9-12, a matrix A is given. 17. |0 3 0 
(a) Construct the submatrices used to Bee eee OF 
compute the minors A; 1, A1,2 and A;.3. 3 -1 O 
: 18. |-3 O -4 
(b) Find the cofactors Ci1, Ci2, and C;,3. . : ‘ 
—7 -3 10 =5° ‘0. <4 
i ia ae 1921) aS 
1 6 10 5 0 -4 
—2 -9 6 1 OO 
10 —10 -6 8 20 0 1 0 
0 —-3 -2 -1 11 
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0 oO -1 -1 2 -1 4 4 
1 1 0 1 3 -3 3 2 
ah 1 1 -1 O an 0) 4 -5 1 
-1 0 1 0 —2 -5 -2 -5 
=< 25. LetAbea2 x 2 matrix; 
2 -—1 0 0 1 
: 1 1 1 0 a. iB 
A= 
1 oO -1 -1 ¢ il : 
—-5 1 0 = 60 
—-3 -5 2 5 Show why det(A) = ad—bc by comput- 
23. —2 4 -34 ing the cofactor expansion of A along 
5 4 -3 3 the first row. 


3.4 Properties of the Determinant 


AS YOU READ 


1. Having the choice to compute the determinant of a matrix using cofactor expan- 
sion along any row or column is most useful when there are lots of what in a row 
or column? 


2. Which elementary row operation does not change the determinant of a matrix? 
3. Why do mathematicians rarely smile? 


4. T/F: When computers are used to compute the determinant of a matrix, cofactor 
expansion is rarely used. 


In the previous section we learned how to compute the determinant. In this sec- 
tion we learn some of the properties of the determinant, and this will allow us to 
compute determinants more easily. In the next section we will see one application 
of determinants. 

We start with a theorem that gives us more freedom when computing determi- 
nants. 


Theorem 14 Cofactor Expansion Along Any Row or Column 


Let A be ann x n matrix. The determinant of A can be com- 
puted using cofactor expansion along any row or column of 
A. 


3.4 Properties of the Determinant 


We alluded to this fact way back after Example 70. We had just learned what cofac- 
tor expansion was and we practiced along the second row and down the third column. 
Later, we found the determinant of this matrix by computing the cofactor expansion 
along the first row. In all three cases, we got the number O. This wasn’t a coincidence. 
The above theorem states that all three expansions were actually computing the de- 
terminant. 

How does this help us? By giving us freedom to choose any row or column to use 
for the expansion, we can choose a row or column that looks “most appealing.” This 
usually means “it has lots of zeros.” We demonstrate this principle below. 


Example 74 Find the determinant of 


—1 
NB 
a) 
Ww 
owoo 
ul wo 
—— 


SOLUTION Our first reaction may well be “Oh no! Not another 4 x 4 deter- 
minant!” However, we can use cofactor expansion along any row or column that we 
choose. The third column looks great; it has lots of zeros in it. The cofactor expansion 
along this column is 


det (A) = a1,3C1,3 + a2,3C2,3 + 43,3C3.3 + a4,3Ca3 
open sees ere sir ae 


The wonderful thing here is that three of our cofactors are multiplied by 0. We 
won’t bother computing them since they will not contribute to the determinant. Thus 


det (A) =3- C33 


1 2 9 
=3-(-1)?*?.-| 2 =-3 5 
—4 1 2 
a ee ee we computed the determinant of the 3 x 3 matrix 
= 3 ( 147) ( without showing our work; it is —147 ) 


= —447 


Wow. That was a lot simpler than computing all that we did in Example 73. Of 
course, in that example, we didn’t really have any shortcuts that we could have em- 
ployed. 

Example 75 Find the determinant of 


12 3 4 5 


06 7 8 9 
A=;0 0 10 11 12 
0 0 O 13 14 
00 0 O 15 
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SOLUTION At first glance, we think “I don’t want to find the determinant of 
a5 x 5 matrix!” However, using our newfound knowledge, we see that things are not 
that bad. In fact, this problem is very easy. 

What row or column should we choose to find the determinant along? There are 
two obvious choices: the first column or the last row. Both have 4 zeros in them. We 
choose the first column.!2 We omit most of the cofactor expansion, since most of it is 
just 0: 


s 7.8 
0 10 11 12 
det (A)=1-(-1)""-| 9 9 43 44 
00 0 15 


Similarly, this determinant is not bad to compute; we again choose to use cofac- 
tor expansion along the first column. Note: technically, this cofactor expansion is 
6 - (—1)**1A, 1; we are going to drop the (—1)*** terms from here on out in this 
example (it will show up a lot...). 


10 11 12 
det(A)=1-6-| 0 13 14 
0 oO 15 


You can probably can see a trend. We’ll finish out the steps without explaining 
each one. 


Oo 15 
=1-6-10-13-15 
= 11700 


det(a) =1-6-10.| - is 


We see that the final determinant is the product of the diagonal entries. This works 
for any triangular matrix (and since diagonal matrices are triangular, it works for diag- 
onal matrices as well). This is an important enough idea that we’ll put it into a box. 


Key Idea 12 The Determinant of Triangular Matrices 


The determinant of a triangular matrix is the product of its 
diagonal elements. 


It is now again time to start thinking like a mathematician. Remember, mathe- 
maticians see something new and often ask “How does this relate to things | already 


19We do not choose this because it is the better choice; both options are good. We simply had to make 
a choice. 


3.4 Properties of the Determinant 


know?” So now we ask, “If we change a matrix in some way, how is it’s determinant 
changed?” 

The standard way that we change matrices is through elementary row operations. 
If we perform an elementary row operation on a matrix, how will the determinant of 
the new matrix compare to the determinant of the original matrix? 

Let’s experiment first and then we'll officially state what happens. 


Example 76 Let 


Let B be formed from A by doing one of the following elementary row operations: 
1. 2R, + R2 > Ro 
2. 5Ry — Ri 
3. Ri & Ro 

Find det(A) as well as det(B) for each of the row operations above. 


SOLUTION It is straightforward to compute det (A) = —2. 
Let B be formed by performing the row operation in 1) on A; thus 


It is clear that det (B) = —2, the same as det(A). 
Now let B be formed by performing the elementary row operation in 2) on A; that 


5 10 
s-[5 *). 
We can see that det (B) = —10, which is 5 - det (A). 


Finally, let B be formed by the third row operation given; swap the two rows of A. 
We see that 


is, 


and that det (B) = 2, which is (—1) - det (A). 


We’ve seen in the above example that there seems to be a relationship between 
the determinants of matrices “before and after” being changed by elementary row 
operations. Certainly, one example isn’t enough to base a theory on, and we have not 
proved anything yet. Regardless, the following theorem is true. 
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Theorem 15 The Determinant and Elementary Row Operations 


Let A be ann x n matrix and let B be formed by performing 
one elementary row operation on A. 


1. If B is formed from A by adding a scalar multiple of 


one row to another, then det (B) = det (A). 


. If Bis formed from A by multiplying one row of A bya 
scalar k, then det (B) = k- det (A). 


. If Bis formed from A by interchanging two rows of A, 
then det (B) = —det (A). 


Let’s put this theorem to use in an example. 


Example 77 Let 


b 

ll 
rPOR 
FPN 
PPR 


Compute det(A), then find the determinants of the following matrices by inspection 
using Theorem 15. 


111 121 1 -1 -2 
B=j;1 21 c=j;0 11 D=|]0 1 1 
011 7 7 7 1 1 1 
SOLUTION Computing det(A) by cofactor expansion down the first column or 


along the second row seems like the best choice, utilizing the one zero in the matrix. 
We can quickly confirm that det (A) = 1. 

To compute det(B), notice that the rows of A were rearranged to form B. There are 
different ways to describe what happened; saying R; <> R2 was followed by Ri © R3 
produces B from A. Since there were two row swaps, det (B) = (—1)(—1)det (A) = 
det (A) = 1. 


Notice that Cis formed from A by multiplying the third row by 7. Thus det (C) = 
7+ det (A) =7. 


It takes a little thought, but we can form D from A by the operation —3R2+R1 — Ri. 


This type of elementary row operation does not change determinants, so det (D) = 
det (A). 
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Let’s continue to think like mathematicians; mathematicians tend to remember 
“oroblems” they’ve encountered in the past,2° and when they learn something new, 
in the backs of their minds they try to apply their new knowledge to solve their old 
problem. 

What “problem” did we recently uncover? We stated in the last chapter that even 
computers could not compute the determinant of large matrices with cofactor expan- 
sion. How then can we compute the determinant of large matrices? 

We just learned two interesting and useful facts about matrix determinants. First, 
the determinant of a triangular matrix is easy to compute: just multiply the diagonal 
elements. Secondly, we know how elementary row operations affect the determinant. 
Put these two ideas together: given any square matrix, we can use elementary row op- 
erations to put the matrix in triangular form,2? find the determinant of the new matrix 
(which is easy), and then adjust that number by recalling what elementary operations 
we performed. Let’s practice this. 


Example 78 Find the determinant of A by first putting A into a triangular form, 
where 
2 4 -2 
A=|-1 -—2 5 
3 2 1 
SOLUTION In putting A into a triangular form, we need not worry about get- 


ting leading 1s, but it does tend to make our life easier as we work out a problem by 
hand. So let’s scale the first row by 1/2: 


1 2 -1 
5Ri > Ri —1 —2 5 
3 2 1 
Now let’s get Os below this leading 1: 
Ri +R) >R Se 
1 2 2 
0 Oo 4 
—3R R R 
1+ R3 — R3 0-4 4 


We can finish in one step; by interchanging rows 2 and 3 we'll have our matrix in 
triangular form. 


1 2 -1 
Ro + R3 Oo -4 4 
0 =O 4 


Let’s name this last matrix B. The determinant of B is easy to compute as it is 
triangular; det (B) = —16. We can use this to find det(A). 
Recall the steps we used to transform A into B. They are: 


20which is why mathematicians rarely smile: they are remembering their problems 
21or echelon form 


151 


Chapter 3 Operations on Matrices 


152 


5R1 > R, 
Ri + Ro — Ro 
—3R, +R3— R3 
Ra © R3 


The first operation multiplied a row of A by ;. This means that the resulting matrix 
had a determinant that was ; the determinant of A. 

The next two operations did not affect the determinant at all. The last operation, 
the row swap, changed the sign. Combining these effects, we know that 


1 
—16 = det (B) = (—1) ,det (A). 
Solving for det (A) we have that det (A) = 32. 


In practice, we don’t need to keep track of operations where we add multiples 
of one row to another; they simply do not affect the determinant. Also, in practice, 
these steps are carried out by a computer, and computers don’t care about leading 1s. 
Therefore, row scaling operations are rarely used. The only things to keep track of are 
row swaps, and even then all we care about are the number of row swaps. An odd 
number of row swaps means that the original determinant has the opposite sign of 
the triangular form matrix; an even number of row swaps means they have the same 
determinant. 

Let’s practice this again. 


Example 79 The matrix B was formed from A using the following elementary 
row operations, though not necessarily in this order. Find det(A). 


2R, >R 
ae? 1p, > Rs 
B=|0 4 5 Sais 
006 Ri & Ro 
6R, + Ro > Ro 
SOLUTION It is easy to compute det (B) = 24. In looking at our list of elemen- 


tary row operations, we see that only the first three have an effect on the determinant. 
Therefore 


24 = det (B) = 2- ; | (=1) - det (A) 


and hence 


In the previous example, we may have been tempted to “rebuild” A using the ele- 
mentary row operations and then computing the determinant. This can be done, but 
in general it is a bad idea; it takes too much work and it is too easy to make a mistake. 

Let’s think some more like a mathematician. How does the determinant work with 
other matrix operations that we know? Specifically, how does the determinant in- 
teract with matrix addition, scalar multiplication, matrix multiplication, the transpose 


3.4 Properties of the Determinant 


and the trace? We’ll again do an example to get an idea of what is going on, then give 
a theorem to state what is true. 


1 2 2 1 
A= E A and B= E a 
Find the determinants of the matrices A, B, A + B, 3A, AB, A’, A+, and compare the 
determinant of these matrices to their trace. 


Example 80 Let 


SOLUTION We can quickly compute that det (A) = —2 and that det (B) = 7. 
1 2 2 1 
det (A — B) = det (c | _ E al) 
2 
~} 0 =-1 
=1 


It’s tough to find a connection between det(A — B), det(A) and det(B). 


det (24) = | ae | 
= —18 


We can figure this one out; multiplying one row of A by 3 increases the determinant 
by a factor of 3; doing it again (and hence multiplying both rows by 3) increases the 
determinant again by a factor of 3. Therefore det (3A) = 3-3 - det (A), or 37 -A. 


sea) — ace ([? 2] [2 2)) 


| 8 1 
18 23 
=-14 


This one seems clear; det (AB) = det (A) det (B). 
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Obviously det (A") = det (A); is this always going to be the case? If we think about 
it, we can see that the cofactor expansion along the first row of A will give us the same 
result as the cofactor expansion along the first column of A’.22 


-1) _| —2 1 
ae = 3/2 =f 
= 1-3/2 
=-—1/2 
It seems as though 
1 
det (A~*) = ’ 
ena) det (A) 


We end by remarking that there seems to be no connection whatsoever between 
the trace of a matrix and its determinant. We leave it to the reader to compute the 
trace for some of the above matrices and confirm this statement. 


We now state a theorem which will confirm our conjectures from the previous 
example. 


Theorem 16 Determinant Properties 


Let A and B be n Xx n matrices and let k be a scalar. The 
following are true: 


. det (kA) = k” - det (A) 

_ det (A’) = det (A) 

. det (AB) = det (A) det (B) 
. IfAis invertible, then 


= 1 
~ det (A)’ 


det (A~*) 


. A matrix A is invertible if and only if det (A) 4 0. 


This last statement of the above theorem is significant: what happens if det (A) = 
0? It seems that det (A~*) =“1/0", which is undefined. There actually isn’t a problem 
here; it turns out that if det (A) = 0, then A is not invertible (hence part 5 of Theorem 
16). This allows us to add on to our Invertible Matrix Theorem. 


22This can be a bit tricky to think out in your head. Try it with a 3x3 matrix A and see how it works. All 
the 2 x 2 submatrices that are created in A’ are the transpose of those found in A; this doesn’t matter since 
it is easy to see that the determinant isn’t affected by the transpose in a 2 x 2 matrix. 
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Theorem 17 Invertible Matrix Theorem 


Let A be ann xn matrix. The following statements are equiv- 
alent. 


(a) A is invertible. 


(g) det (A) £0. 


This new addition to the Invertible Matrix Theorem is very useful; we’ll refer back 
to it in Chapter 4 when we discuss eigenvalues. 


We end this section with a shortcut for computing the determinants of 3 x 3 ma- 
trices. Consider the matrix A: 


N BPR 
com nN 
OD Ww 


We can compute its determinant using cofactor expansion as we did in Example 71. 
Once one becomes proficient at this method, computing the determinant of a 3 x 3 
isn’t all that hard. A method many find easier, though, starts with rewriting the matrix 
without the brackets, and repeating the first and second columns at the end as shown 
below. 


N BR 
oun 
OD Ww 
N BPR 
oun 


In this 3 x 5 array of numbers, there are 3 full “upper left to lower right” diagonals, 
and 3 full “upper right to lower left” diagonals, as shown below with the arrows. 


105 48 72 45 84 96 
The numbers that appear at the ends of each of the arrows are computed by mul- 
tiplying the numbers found along the arrows. For instance, the 105 comes from mul- 
tiplying 3-5-7 = 105. The determinant is found by adding the numbers on the right, 
and subtracting the sum of the numbers on the left. That is, 
det (A) = (45 + 84+ 96) — (105 + 48 + 72) =0. 


To help remind ourselves of this shortcut, we’ll make it into a Key Idea. 
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Key Idea 13 3 x 3 Determinant Shortcut 


Let A be a3 x 3 matrix. Create a3 x 5 array by repeating 
the first 2 columns and consider the products of the 3 “right 
hand” diagonals and 3 “left hand” diagonals as shown pre- 


viously. Then 


det(A) = “(the sum of the right hand numbers) 
— (the sum of the left hand numbers)”. 


We'll practice once more in the context of an example. 


Example 81 Find the determinant of A using the previously described shortcut, 
where 
1 3 9 
A=j|-2 3 4 
—-5 7 2 
SOLUTION Rewriting the first 2 columns, drawing the proper diagonals, and 


multiplying, we get: 


—135 28—12 6 —60 —126 


Summing the numbers on the right and subtracting the sum of the numbers on the 
left, we get 


det (A) = (6 — 60 — 126) — (—135 + 28 — 12) = —61. 


In the next section we’ll see how the determinant can be used to solve systems of 
linear equations. 


Exercises 3.4 


In Exercises 1 —- 14, find the determinant 4 4 —4 
of the given matrix using cofactor expansion 2. 0 0 —3 
along any row or column you choose. —2 -2 -1 
1 2 3 -4 1 1 
1.)-5 0 3 3. 0 0 0 
4 0 6 —-1 -—2 -5 


O: 3o7-4 
4.|/0 0 5 
Sh 4 06 
9 28 °5 
ae ca 
=i 20° 0 
Ae 29+ --H 
6 [2 S858 
=§ t 0 
=9 <0. -=5 
7a a 8 
=f 0° 4 
ae 
ran i a 
ae a 
5) air r> 74 
2 a = ad 
PLES og 59. oe 
af a8 Or 45 
AS: Bg 
0 ° O° “69 
oe toy ee ee ae 
0 0 2 0 
6 26. 0: 9 
0 0 8 0 
oan a ee 
SAM at as eB 
af 0." -a8g 6G 
g 6 oT ae 
Be Wehy nde te 
a 0 6e -0 
40510 
10315 
133.) 2) 22 2 
10000 
442 5 3 
ye eae (gees Te 
Wo 95.0%. 2 
14./0 0100 
1.32 0.3 
505 04 


In Exercises 15 - 18, a matrix VM and det(V) 
are given. Matrices A, B and C are formed by 
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performing operations on /. Determine the 
determinants of A, B and C using Theorems 
15 and 16, and indicate the operations used 
to form A, B and C. 


Ge 3 25 
15.M=|/3 1 O|, 
a eer | 
det(M) = —41 
0 3 5 
(a) A=|]—-2 -4 -1 
a 3k, 
0 3 5 
(6) B=]3 1 O 
8 16 4 
3 4 «5 
c)c=]3 1 0 
= ae ar 
9 7 8 
16.M={1 3 7I, 
6 3 3 
det(M) = 45 
18 14 16 
(a) A=]1 3 7 
6 3 3 
9 7 8 
(ob) B=}|1 3 7 
96 73 83 
9 1 6 
c) C=|7 3 3 
8 7 3 
5 15 
17.M=1|/4 0 2], 
00 4 
det(M) = —16 
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5 4 0 pe b 0 
18.M=]7 9 3], 4 —-4 
13 9 . ; 2 
det(M) = 120 In Exercises 23 - 30, find the determinant of 
the given matrix using Key Idea 13. 
13 9 
(a) A~|7 9 3 3 2 3 
5 4 0 23. —6 1 —10 
-8 -9 -9 
5 4 0 
(b) B= |14 18 6 8 -9 -2 
3 9 27 24. —9 9 —7 
5 _4 0 5 —1 9 
(c) C=|-7 -9 -3 a 
Sp Aaa ee Sued! eho a 
In Exercises 19 — 22, matrices A and B are 3 —4 #5 
given. Verify part 3 of Theorem 16 by com- 
puting det(A), det(B) and det(A8). | 1-2 1 
5, a 26. | 5 5 4 
19. A= le HE 4 0 0 
fo —4 1 -4 1 
B=|, 3 a7 ho? <8! 9G 
3 5 ‘ 1 2 2 
20. A= E 1 | 3 4 0 
[4 -1 28. }-3 O —-4 
B=! _. 3 a 
—4 4 —5 0 -4 
Ree 5 aA 29. | 2 4 -1 
es = _ 5 0 4 
5 -3 
- 1 0 0 
—-3 -1 30. 0 1 O 
2 Ae ai [° a 
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3.5  Cramer’s Rule 


AS YOU READ 


1. T/F: Cramer’s Rule is another method to compute the determinant of a matrix. 


2. T/F: Cramer’s Rule is often used because it is more efficient than Gaussian elim- 
ination. 


3. Mathematicians use what word to describe the connections between seemingly 
unrelated ideas? 


In the previous sections we have learned about the determinant, but we haven’t 
given a really good reason why we would want to compute it.2? This section shows one 
application of the determinant: solving systems of linear equations. We introduce this 
idea in terms of a theorem, then we will practice. 


Theorem 18 Cramer’s Rule 


Let A be ann x n matrix with det (A) 4 0 and let b be an 
n xX 1 column vector. Then the linear system 


Ax =b 


has solution 


det (4i(6)) 


det(A) ” 


xi = 


where Ai(b) is the matrix formed by replacing the i column 
of A with b. 


Let’s do an example. 


Example 82 Use Cramer’s Rule to solve the linear system AX = b where 
1 5 -3 —36 
A=|1 4 2 and b= | —11 
2 -1 O 7 


3The closest we came to motivation is that if det (A) = 0, then we know that A is not invertible. But it 
seems that there may be easier ways to check. 
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SOLUTION We first compute the determinant of A to see if we can apply Cramer’s 
Rule. 
1 5 -3 
det(A)=| 1 4 2 |=49. 
2 -1 O 


Since det (A) 4 0, we can apply Cramer’s Rule. Following Theorem 18, we com- 
pute det(i(6)), det(2(6)) and det(43(6)). 


=o6 kt 3 
det (4:(6)) =| 4% 4 2 || a9, 
7 mt 


(We used a bold font to show where b replaced the first column of A.) 


7 1 -36 -—3 
det (42(6)) =i 4> qa. @. || cep Ag: 
2 7 0 
1 5 —36 
det (43(6)) =|1 4 —11 |=196. 
2 -1 7 
Therefore we can compute x: 
xy = = =1 
det (A) 49 
oe det(A) 49 
det (43 (6)) 196 
— = — 4 
8 det (A) 49 
Therefore 
X1 1 
x= X2 = —5 
X3 4 
Let’s do another example. 
Example 83 Use Cramer’s Rule to solve the linear system Ax = b where 


1 2 > —1 
a=|3 ;| and B= |". 
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SOLUTION The determinant of A is —2, so we can apply Cramer’s Rule. 
det (4:(6)) = | Fe ; | 226 
Therefore 
= = —— 3 
a det (A) =) 
det (42(6)) i 
= = =-2 
0 det (A) a2 
and 


We learned in Section 3.4 that when considering a linear system Ax = b where A 
is square, if det (A) 4 0 then A is invertible and Ax = b has exactly one solution. We 
also stated in Key Idea 11 that if det (A) = 0, then A is not invertible and so therefore 
either Ax = b has no solution or infinite solutions. Our method of figuring out which 
of these cases applied was to form the augmented matrix [A b| , put it into reduced 
row echelon form, and then interpret the results. 

Cramer’s Rule specifies that det (A) 4 0 (so we are guaranteed a solution). When 
det (A) = 0 we are not able to discern whether infinite solutions or no solution exists 
for a given vector b. Cramer’s Rule is only applicable to the case when exactly one 
solution exists. 


We end this section with a practical consideration. We have mentioned before 
that finding determinants is a computationally intensive operation. To solve a linear 
system with 3 equations and 3 unknowns, we need to compute 4 determinants. Just 
think: with 10 equations and 10 unknowns, we’d need to compute 11 really hard de- 
terminants of 10 x 10 matrices! That is a lot of work! 

The upshot of this is that Cramer’s Rule makes for a poor choice in solving nu- 
merical linear systems. It simply is not done in practice; it is hard to beat Gaussian 
elimination.24 

So why include it? Because its truth is amazing. The determinant is a very strange 
operation; it produces a number in a very odd way. It should seem incredible to the 


4A version of Cramer’s Rule is often taught in introductory differential equations courses as it can be 
used to find solutions to certain linear differential equations. In this situation, the entries of the matrices 
are functions, not numbers, and hence computing determinants is easier than using Gaussian elimination. 
Again, though, as the matrices get large, other solution methods are resorted to. 
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reader that by manipulating determinants in a particular way, we can solve linear sys- 


tems. 


In the next chapter we’ll see another use for the determinant. Meanwhile, try to 
develop a deeper appreciation of math: odd, complicated things that seem completely 
unrelated often are intricately tied together. Mathematicians see these connections 
and describe them as “beautiful.” 


Exercises 3.5 


In Exercises 1-12, matrices A and bare given. 
(a) Give det(A) and det(A;) for all i. 


(b) Use Cramer’s Rule to solve AX = b. If 


Cramer’s Rule cannot be used to find 
the solution, then state whether or not 
a solution exists. 


[7 -7 ~ 28 
wa=[7 Gg [ pe pea 
9 5 ~ —45 
ier | mA B=|0| 
-8 16 ~ —48 
a al B= || 
0 —-6 ~ 6 
TDS Nig #7 ve Ea 
[2 10 ~ 42 
Pe | eae gt 149 
i 
7 14 > =i 
aS |s, eal el 
3 0 -3 24 
7,A=/15 4 4], b=]|o0 
5 5 -4 31 
4 9 3 
8 A=]-5 —2 -13], 
—1 10 —13 
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10. 


11. 


12. 


—28 
b= | 35 
7 
4. =4 0 16 
A=|5 1 -1|, b=|22 
S ok 2 8 
1) 02. 40 
A=|4 -3 -10], 
9 (6. a 
—40 
b= | -94 
132 
Ph “35 
As lade Be a7, 
9 -7 34 
=f 
b= | -3 
5 
Sb. Fs 7 
A=|5 4 11], 
5 4 8 
58 
b= | —35 
—49 
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EIGENVALUES AND EIGENVECTORS 


We have often explored new ideas in matrix algebra by making connections to our 
previous algebraic experience. Adding two numbers, x + y, led us to adding vectors 
X + Yand adding matrices A + B. We explored multiplication, which then led us to 
solving the matrix equation AX = b, which was reminiscent of solving the algebra 
equation ax = b. 

This chapter is motivated by another analogy. Consider: when we multiply an un- 
known number x by another number such as 5, what do we know about the result? 
Unless, x = 0, we know that in some sense 5x will be “5 times bigger than x.” Applying 
this to vectors, we would readily agree that 5X gives a vector that is “5 times bigger 
than x.” Each entry in x is multiplied by 5. 

Within the matrix algebra context, though, we have two types of multiplication: 
scalar and matrix multiplication. What happens to x when we multiply it by a matrix 
A? Our first response is likely along the lines of “You just get another vector. There is 
no definable relationship.” We might wonder if there is ever the case where a matrix 
— vector multiplication is very similar to a scalar — vector multiplication. That is, do we 
ever have the case where Ax = ax, where a is some scalar? That is the motivating 
question of this chapter. 


4.1 Eigenvalues and Eigenvectors 


AS YOU READ 


1. T/F: Given any matrix A, we can always find a vector X where Ax = x. 
2. When is the zero vector an eigenvector for a matrix? 
3. If Vis an eigenvector of a matrix A with eigenvalue of 2, then what is AV? 


4. T/F:|f Aisa5 x 5 matrix, to find the eigenvalues of A, we would need to find the 
roots of a 5‘ degree polynomial. 
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We start by considering the matrix A and vector x as given below.? 


“Ey =f 


Multiplying Ax gives: 


Wow! It looks like multiplying Ax is the same as 5x! This makes us wonder lots 
of things: Is this the only case in the world where something like this happens?* Is A 
somehow a special matrix, and Ax = 5x for any vector X we pick?? Or maybe xX was a 
special vector, and no matter what 2 x 2 matrix A we picked, we would have Ax = 5x.* 

A more likely explanation is this: given the matrix A, the number 5 and the vector 
X formed a special pair that happened to work together in a nice way. It is then natural 
to wonder if other “special” pairs exist. For instance, could we find a vector x where 
AX = 3x? 

This equation is hard to solve at first; we are not used to matrix equations where 
X appears on both sides of “=.” Therefore we put off solving this for just a moment to 
state a definition and make a few comments. 


Definition 27 Eigenvalues and Eigenvectors 


Let A be ann x n matrix, X a nonzero n x 1 column vector 


and J ascalar. If 
AX — XX, 


then x is an eigenvector of A and 2 is an eigenvalue of A. 


The word “eigen” is German for “proper” or “characteristic.” Therefore, an eigen- 
vector of Ais a “characteristic vector of A.” This vector tells us something about A. 

Why do we use the Greek letter \ (lambda)? It is pure tradition. Above, we used a 
to represent the unknown scalar, since we are used to that notation. We now switch 
to because that is how everyone else does it.” Don’t get hung up on this; ) is just a 
number. 


1Recall this matrix and vector were used in Example 40 on page 75. 
2Probably not. 

3 Probably not. 

4See footnote 2. 

5An example of mathematical peer pressure. 
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Note that our definition requires that A be a square matrix. If A isn’t square then 
Ax and AX will have different sizes, and so they cannot be equal. Also note that X must 
be nonzero. Why? What if X = 0? Then no matter what is, AX = AX. This would 
then imply that every number is an eigenvalue; if every number is an eigenvalue, then 
we wouldn’t need a definition for it. Therefore we specify that x 4 0. 

Our last comment before trying to find eigenvalues and eigenvectors for given ma- 
trices deals with “why we care.” Did we stumble upon a mathematical curiosity, or 
does this somehow help us build better bridges, heal the sick, send astronauts into 
orbit, design optical equipment, and understand quantum mechanics? The answer, of 
course, is “Yes.”” This is a wonderful topic in and of itself: we need no external applica- 
tion to appreciate its worth. At the same time, it has many, many applications to “the 
real world.” A simple Internet seach on “applications of eigenvalues” with confirm this. 

Back to our math. Given a square matrix A, we want to find a nonzero vector x 
and a scalar such that AX = AX. We will solve this using the skills we developed in 
Chapter 2. 


AX = AX original equation 
Ax—r»X = O subtract \x from both sides 
(A = M)X O factor out X 


Think about this last factorization. We are likely tempted to say 
AX — »X = (A— X)X, 


but this really doesn’t make sense. After all, what does “a matrix minus a number” 
mean? We need the identity matrix in order for this to be logical. 

Let us now think about the equation (A — \/)X = 0. While it looks complicated, it 
really is just matrix equation of the type we solved in Section 2.4. We are just trying 
to solve Bx = 0, where B = (A — 2). 

We know from our previous work that this type of equation® always has a solution, 
namely, X = 0. However, we want x to be an eigenvector and, by the definition, 
eigenvectors cannot be 0. 

This means that we want solutions to (A — \/)Xx = 0 other than x = O. Recall 
that Theorem 8 says that if the matrix (A — /) is invertible, then the only solution to 
(A — \I)xX = Ois X = 0. Therefore, in order to have other solutions, we need (A — 2/) 
to not be invertible. 

Finally, recall from Theorem 16 that noninvertible matrices all have a determi- 
nant of 0. Therefore, if we want to find eigenvalues \ and eigenvectors x, we need 
det (A — Al) = 0. 

Let’s start our practice of this theory by finding \ such that det (A — A/) = 0; that 
is, let’s find the eigenvalues of a matrix. 


Recall footnote 17 on page 107. 

7Except for the “understand quantum mechanics” part. Nobody truly understands that stuff; they just 
probably understand it. 

8Recall this is a homogeneous system of equations. 
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Example 84 Find the eigenvalues of A, that is, find \ such that det (A — A/) = 0, 
where 
_|1 4 
~ 12 3]¢ 
SOLUTION (Note that this is the matrix we used at the beginning of this sec- 


tion.) First, we write out what A — A/ is: 
1 4 10 
a-u=[> s|-als fl 

_f1 4) [A 0 
~ 12 3 0 A 
_f1-A 4 

_ 2 3-— 

Therefore, 


det(a— ay =| 77% ‘ | 


2 wad 
(La Har 8 
=)? —-4\—5 


l| 


Since we want det (A — \/) = 0, we want \* — 4\ — 5 = 0. This is a simple 
quadratic equation that is easy to factor: 


-—4\-5=0 
(A—5)(A+1) =0 
A=-1,5 


According to our above work, det (A — \/) = 0 when A = —1, 5. Thus, the eigen- 
values of A are —1 and 5. 


Earlier, when looking at the same matrix as used in our example, we wondered if 
we could find a vector X such that AX = 3x. According to this example, the answer is 
“No.” With this matrix A, the only values of \ that work are —1 and 5. 

Let’s restate the above in a different way: It is pointless to try to find x where 
Ax = 3x, for there is no such X. There are only 2 equations of this form that have a 
solution, namely 


> 


Ax = —x and AX = 5X. 


As we introduced this section, we gave a vector x such that Ax = 5x. Is this the 
only one? Let’s find out while calling our work an example; this will amount to finding 
the eigenvectors of A that correspond to the eigenvector of 5. 
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Example 85 Find X such that Ax = 5x, where 
1 4 
jl), 
SOLUTION Recall that our algebra from before showed that if 


AX =x then (A—AX)x=0. 


Therefore, we need to solve the equation (A — \/)x = 0 for X when A = 5. 


a-sie[> S]-s[0 1] 
-|7 4] 


To solve (A—5/)x = 0, we form the augmented matrix and put it into reduced row 
echelon form: 


—2 0 ney 0 0 0 


| me | 
| 
Yon 


4 4 — E —1 | 
Thus 


X, = X2 


X2 is free 


-(2]--(} 


We have infinite solutions to the equation Ax = 5x; any nonzero scalar multiple of the 


and 


1]. : : ‘ 
| is a solution. We can do a few examples to confirm this: 


[> 3] [2]= [0] ==[2) 


vector | 


1 4] /-3]_[-15]_.[-3 
2 3}]-3] | -15] — —3]° 
Our method of finding the eigenvalues of a matrix A boils down to determining 
which values of A give the matrix (A—A/) a determinant of 0. In computing det (A — X/), 
we get a polynomial in \ whose roots are the eigenvalues of A. This polynomial is im- 


portant and so it gets its own name. 
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Definition 28 Characteristic Polynomial 


Let A be ann x n matrix. The characteristic polynomial of A 
is the n“ degree polynomial p(\) = det (A — XI). 


Our definition just states what the characteristic polynomial is. We know from our 
work so far why we care: the roots of the characteristic polynomial of ann x n matrix 
A are the eigenvalues of A. 

In Examples 84 and 85, we found eigenvalues and eigenvectors, respectively, of 
a given matrix. That is, given a matrix A, we found values \ and vectors X such that 
Ax = 2X. The steps that follow outline the general procedure for finding eigenvalues 
and eigenvectors; we'll follow this up with some examples. 


Key Idea 14 Finding Eigenvalues and Eigenvectors 


Let A be ann x n matrix. 


1. To find the eigenvalues of A, compute p(A), the char- 
acteristic polynomial of A, set it equal to 0, then solve 
for X. 


2. To find the eigenvectors of A, for each eigenvalue 
solve the homogeneous system (A — \I)x = 0. 


Example 86 Find the eigenvalues of A, and for each eigenvalue, find an eigen- 
vector where 

—3 #15 

Pal 
SOLUTION To find the eigenvalues, we must compute det (A — A/) and set it 

equal to 0. 

—3- xX 15 

det(A— ©) =| 3 ra 


(9) 2) 45 
=)? — 6-27-45 
=)? 6-72 

= (A—12)(A +6) 


Therefore, det (A — A!) = Owhen A = —6 and 12; these are our eigenvalues. (We 
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should note that p(A) = d” — 6\ — 72 is our characteristic polynomial.) It sometimes 


helps to give them “names,” so we'll say Ay = —6 and Az = 12. Now we find eigen- 
vectors. 
For A; = —6: 


We need to solve the equation (A — (—6)/)x = 0. To do this, we form the appro- 
priate augmented matrix and put it into reduced row echelon form. 


3 15 O ee 15 0 
3 15 6 ne 0 0 Oo] 
Our solution is 

X1 = —5x2 


X2 is free; 


xX =X? 1 . 


We may pick any nonzero value for x2 to get an eigenvector; a simple option is xz = 1. 
Thus we have the eigenvector 


in vector form, we have 


(We used the notation x; to associate this eigenvector with the eigenvalue ),.) 


We now repeat this process to find an eigenvector for Az = 12: 
In solving (A — 12/)xX = 0, we find 


—15 15 O nee 1 -1 0 
3 -3 0 0 O O 
In vector form, we have 
xX=x : 
=|4|- 
Again, we may pick any nonzero value for x2, and so we choose x2 = 1. Thus an 
eigenvector for Az is 
cult 
2 ll glee 
To summarize, we have: 
: : ‘ ~ —5 
eigenvalue \; = —6 with eigenvector x; = 1 


and 


= 1 
eigenvalue Az = 12 with eigenvector x2 = | : 
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We should take a moment and check our work: is it true that Ax; = 1x7? 
= —3 15 —5 
“i= [2 2] [ 2] 
_ | 30 
~ |—6 
—5 
-o[z 


= 1X4. 
Yes; it appears we have truly found an eigenvalue/eigenvector pair for the matrix A. 


Let’s do another example. 


-3 0 
Example 87 LetA = | 5 | . Find the eigenvalues of A and an eigenvector 


for each eigenvalue. 


SOLUTION We first compute the characteristic polynomial, set it equal to 0, 
then solve for X. 


—3- xX 0 
5 1-> 
(—3 — A)(1— A) 


det (A — Al) = | 


From this, we see that det (A — A/) = O when \ = —3,1. We'll set \; = —3 and 
A2 = 1. 


Finding an eigenvector for 1: 
We solve (A — (—3)/)xX = O for xX by row reducing the appropriate matrix: 


sss] wm eg 


5 4 0 ee 0 0 0 


Our solution, in vector form, is 


Again, we can pick any nonzero value for x2; a nice choice would eliminate the fraction. 
Therefore we pick x2 = 4, and find 


Finding an eigenvector for A2: 
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We solve (A — (1)/)x = O for x by row reducing the appropriate matrix: 


-—4 0 0 ree 1 0 0 
5 0 0 0 0 O|° 


We’ve seen a matrix like this before,” but we may need a bit of a refreshing. Our 
first row tells us that x; = O, and we see that no rows/equations involve x2. We con- 
clude that x2 is free. Therefore, our solution, in vector form, is 


ea 
X = X2 1]° 


We pick x2 = 1, and find 


To summarize, we have: 
. ‘ ‘ = —5 
eigenvalue \; = —3 with eigenvector x; = 


and 


_ 0 
eigenvalue \2 = 1 with eigenvector x) = | : 


So far, our examples have involved 2 x 2 matrices. Let’s do an example with a 3 x 3 
matrix. 


Example 88 Find the eigenvalues of A, and for each eigenvalue, give one eigen- 
vector, where 
—7 -—2 10 
A=j|-3 2 3 
-6 -2 9 
SOLUTION We first compute the characteristic polynomial, set it equal to 0, 


then solve for A. A warning: this process is rather long. We’ll use cofactor expansion 
along the first row; don’t get bogged down with the arithmetic that comes from each 
step; just try to get the basic idea of what was done from step to step. 


9See page 31. Our future need of knowing how to handle this situation is foretold in footnote 5. 
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lI 

| 
w 
N 
| 
- 
w 


det (A — AI) 


= 9 |- 4] 23 salto] oe 9 
= (-7—A)(A? — 114 + 24) + 2(34 — 9) + 10(—6) + 18) 
=—-\3+4)-r-6 

= —(A+1)(A— 2)(A—-3) 


(-7-A) 


In the last step we factored the characteristic polynomial —\? +4? — \—6. Factoring 
polynomials of degree > 2 is not trivial; we’ll assume the reader has access to methods 
for doing this accurately. 

Our eigenvalues are A; = —1, A2 = 2 and \3 = 3. We now find corresponding 
eigenvectors. 


For A, = —1: 


We need to solve the equation (A — (—1)!)X = 0. To do this, we form the appro- 
priate augmented matrix and put it into reduced row echelon form. 


—6 -—2 10 O 10 -15 0 
—3 3 3 0 rref 01 —-5 O 
—6 -—2 10 0O 0 O 0 0 


Our solution, in vector form, is 


We can pick any nonzero value for x3; a nice choice would get rid of the fractions. 
3 


So we’ll set x3 = 2 and choose x; = | 1 | as our eigenvector. 
2 


For Ap = 2: 


We need to solve the equation (A — 2/)x = 0. To do this, we form the appropriate 
augmented matrix and put it into reduced row echelon form. 


10You probably learned how to do this in an algebra course. As a reminder, possible roots can be found 
by factoring the constant term (in this case, —6) of the polynomial. That is, the roots of this equation could 
be +1, +2, +3 and +6. That’s 12 things to check. 

One could also graph this polynomial to find the roots. Graphing will show us that A = 3 Jooks like a root, 
and a simple calculation will confirm that it is. 


4.1 Eigenvalues and Eigenvectors 


-—9 -—2 10 0 10 -1 0 
— 

—3 O 3 0 rref 01 -5 O 

—6 —-2 7 0O 00 0 0 


Our solution, in vector form, is 
x= X3 1/2 


We can pick any nonzero value for x3; again, a nice choice would get rid of the frac- 
2 


tions. So we’ll set x3 = 2 and choose x) = | 1 | as our eigenvector. 
2 


For A3 = 3: 


We need to solve the equation (A — 3/)x = 0. To do this, we form the appropriate 
augmented matrix and put it into reduced row echelon form. 


—10 -2 10 0O 1 0 -1 0 
— 
—3 -1 3 0 rref |}O 1 0 0 
—-6 -2 6 0 00 0 0 
Our solution, in vector form, is (note that x, = 0): 
1 
X =X3 0 
1 
1 
We can pick any nonzero value for x3; an easy choice is x3 = 1, so x3 = | 0] as 
1 


our eigenvector. 


To summarize, we have the following eigenvalue/eigenvector pairs: 


3 

eigenvalue A, = —1 with eigenvector x; = | 1 
2 

eigenvalue \2 = 2 with eigenvector x = 


eigenvalue \3 = 3 with eigenvector x3 = 


| 
| 


POF NR N 


Let’s practice once more. 
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Example 89 Find the eigenvalues of A, and for each eigenvalue, give one eigen- 
vector, where 
2 -1 1 
A=|0 1 6 
0 3 4 
SOLUTION We first compute the characteristic polynomial, set it equal to 0, 
then solve for A. We’ll use cofactor expansion down the first column (since it has lots 


of zeros). 


det (A — Al) 


I 
jo) 
bh 

| 
»- 
a 


lI 
— 
N 

| 
~ 
ae: 


= (2—A)(A? — 5A — 14) 
= (2— A)(A—7)(A +2) 


Notice that while the characteristic polynomial is cubic, we never actually saw a 
cubic; we never distributed the (2 — A) across the quadratic. Instead, we realized that 
this was a factor of the cubic, and just factored the remaining quadratic. (This makes 
this example quite a bit simpler than the previous example.) 

Our eigenvalues are A; = —2, Ap = 2 and \3 = 7. We now find corresponding 
eigenvectors. 


For Ay = —2: 


We need to solve the equation (A — (—2)!)x = 0. To do this, we form the appro- 
priate augmented matrix and put it into reduced row echelon form. 


4-110 1 0 3/4 0 
oy 
0 3 60 rref 01 2 +40 
0 3 60 00 0 0 
Our solution, in vector form, is 
—3/4 
x = X3 —2 
1 
We can pick any nonzero value for x3; a nice choice would get rid of the fractions. 
—3 
So we'll set x3 = 4 and choose x; = | —8 | as our eigenvector. 
4 
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For Az = 2: 


We need to solve the equation (A — 2/)x = 0. To do this, we form the appropriate 
augmented matrix and put it into reduced row echelon form. 


0 -1 140 a 0100 
0 -1 6 0 rref 0010 
0 3 20 00 0 0 


This looks funny, so we’ll look remind ourselves how to solve this. The first two 
rows tell us that x. = O and x3 = O, respectively. Notice that no row/equation uses x1; 
we conclude that it is free. Therefore, our solution in vector form is 


1 
X=X, 0 
0 


We can pick any nonzero value for x,; an easy choice is x; = 1 and choose x5 = 
1 
O | as our eigenvector. 
0 


For \3 = 7: 


We need to solve the equation (A — 7/)x = 0. To do this, we form the appropriate 
augmented matrix and put it into reduced row echelon form. 


—-5 -1 1 O = 1 0 
O -6 6 O rref O01 -1 0 
0 3 -3 0 0 0 


Our solution, in vector form, is (note that x; = 0): 


0 
x= X3 1 
1 
0 
We can pick any nonzero value for x3; an easy choice is x3 = 1, so x3 = | 1] as 
1 


our eigenvector. 


To summarize, we have the following eigenvalue/eigenvector pairs: 


3 
eigenvalue \; = —2 with eigenvector x; = | —8 
4 
1 
eigenvalue \2 = 2 with eigenvector x2 = | 0 
0) 
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0 
eigenvalue A3 = 7 with eigenvector x3 = | 1 
1 


In this section we have learned about a new concept: given a matrix A we can find 
certain values \ and vectors X where AX = \x. In the next section we will continue to 
the pattern we have established in this text: after learning a new concept, we see how 
it interacts with other concepts we know about. That is, we’ll look for connections 
between eigenvalues and eigenvectors and things like the inverse, determinants, the 
trace, the transpose, etc. 


Exercises 4.1 


In Exercises 1-6, a matrix A and one of its —1 
eigenvectors are given. Find the eigenvalue X= 1 
of A for the given eigenvector. 1 
. 9 3 In Exercises 7 — 11, a matrix A and one of its 
1,.A= 6 5 eigenvalues are given. Find an eigenvector of 
b A for the given eigenvalue. 
—4 
x= 16 6 
Fa nae[, 4 
Aa |. <8 os 
| 48 —15 fea be 6 
. fa -9 13 
: —16 -—28 —19 
4 aa) a 9.A—| 42 69 46 
|-2 4 —42 -72 —49 
= 2 A=5 
a | 7 _=G. 2210 
; 10.A=|6 2 -6 
—11 -19 14 2 -5 —5 
4.A=}|-6 -8 6 \=_-3 
—12 -22 15 
4 5 -3 
Pe 11.A=|-7 -8 3 
x= 1/2 1 -5 8 
a \=2 
—7 1 3 In Exercises 12 — 28, find the eigenvalues of 
5. A= 10 2 —3 the given matrix. For each eigenvalue, give an 
—20 -14 1 eigenvector. 
1 —1 -4 
ze | 5 12. = = 
a [4 72 
13. a | 
—12 -10 0 L 
cal 15 13 O a [2 | 
15 18 —5 "|2  —-8 


20. 


21. 


22: 


4.2 


4.2 Properties of Eigenvalues and Eigenvectors 


[3 12 1 O —18 
}1  —1 23. |-4 3 -1 
rs. s 1 0 -8 
= 3 -1 18 0 
[3  -1 24. 1 2 0 
1 3 5 -3 -1 


lo | 
25 0 25. 


27. 


—5 0 28. 


PNP COM COR oo 
| 
N 


SSS an. th 


| 

| 
2-3 26 E 36 

| 

| 


Properties of Eigenvalues and Eigenvectors 


AS YOU READ 


1. 


2. 


T/F: A and A’ have the same eigenvectors. 

T/F: A and A~* have the same eigenvalues. 

T/F: Marie Ennemond Camille Jordan was a guy. 

T/F: Matrices with a trace of 0 are important, although we haven’t seen why. 


T/F: A matrix A is invertible only if 1 is an eigenvalue of A. 


In this section we’ll explore how the eigenvalues and eigenvectors of a matrix relate 
to other properties of that matrix. This section is essentially a hodgepodge of inter- 
esting facts about eigenvalues; the goal here is not to memorize various facts about 
matrix algebra, but to again be amazed at the many connections between mathemat- 
ical concepts. 

We’|l begin our investigations with an example that will give a foundation for other 
discoveries. 
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1 2 3 
Example 90 LetA=j|0O 4 5). Find the eigenvalues of A. 
0 0 6 
SOLUTION To find the eigenvalues, we compute det (A — X/): 
1-.X 2 3 
det(A—\)=}| 0 4-A 5 


0 0 6—A 
= (1— A)(4— A)(6— A) 


Since our matrix is triangular, the determinant is easy to compute; it is just the 
product of the diagonal elements. Therefore, we found (and factored) our character- 
istic polynomial very easily, and we see that we have eigenvalues of \ = 1, 4, and 6. 


This examples demonstrates a wonderful fact for us: the eigenvalues of a triangular 
matrix are simply the entries on the diagonal. Finding the corresponding eigenvectors 
still takes some work, but finding the eigenvalues is easy. 

With that fact in the backs of our minds, let us proceed to the next example where 
we will come across some more interesting facts about eigenvalues and eigenvectors. 


—7 -—2 10 
| and letB = |—3 2 3 | (as used in 
-6 -—2 9 


—3 15 
3 9 


Example 91 LetA = | 
Examples 86 and 88, respectively). Find the following: 
1. eigenvalues and eigenvectors of A and B 
2. eigenvalues and eigenvectors of A~? and B-? 
3. eigenvalues and eigenvectors of A’ and B’ 


4. The trace of Aand B 


5. The determinant of A and B 


SOLUTION We'll answer each in turn. 
1. We already know the answer to these for we did this work in previous examples. 
Therefore we just list the answers. 


For A, we have eigenvalues \ = —6 and 12, with eigenvectors 


= 1 : 
X=X) | 1 | and x2 H , respectively. 


For B, we have eigenvalues \ = —1, 2, and 3 with eigenvectors 
3 2 1 
X=x3]1],x3]1] and x3 | 0] , respectively. 
2 2 1 
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2. We first compute the inverses of A and B. They are: 


-4 1/3 13/3 
_, [-1/8 5/24 i 
A ai | ee a = te Ae 


Finding the eigenvalues and eigenvectors of these matrices is not terribly hard, 
but it is not “easy,” either. Therefore, we omit showing the intermediate steps 
and go right to the conclusions. 


For A~*, we have eigenvalues \ = —1/6 and 1/12, with eigenvectors 


= 1 : 
X =X) 1 and x2 1 , respectively. 


For B~+, we have eigenvalues \ = —1, 1/2 and 1/3 with eigenvectors 
3 2 1 
X=x3]1]|,x3]1] and x3 | 0] , respectively. 
2 2 1 


3. Of course, computing the transpose of A and Bis easy; computing their eigenval- 
ues and eigenvectors takes more work. Again, we omit the intermediate steps. 


For A’, we have eigenvalues \ = —6 and 12 with eigenvectors 


= 5 : 
X=X) 1 and x2 al? respectively. 


For B’, we have eigenvalues \ = —1, 2 and 3 with eigenvectors 
—1 —1 0 
X=x3| 0 |,x3] 1 and x3 | —2 | , respectively. 
1 1 1 


4. The trace of A is 6; the trace of B is 4. 


5. The determinant of A is —72; the determinant of B is —6. 


Now that we have completed the “grunt work,” let’s analyze the results of the pre- 
vious example. We are looking for any patterns or relationships that we can find. 


The eigenvalues and eigenvectors of A and A~?. 


In our example, we found that the eigenvalues of A are —6 and 12; the eigenvalues 
of A~? are —1/6 and 1/12. Also, the eigenvalues of B are —1, 2 and 3, whereas the 
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eigenvalues of B~* are —1, 1/2 and 1/3. There is an obvious relationship here; it 
seems that if A is an eigenvalue of A, then 1/, will be an eigenvalue of A~+. We can 
also note that the corresponding eigenvectors matched, too. 

Why is this the case? Consider an invertible matrix A with eigenvalue \ and eigen- 
vector X. Then, by definition, we know that AX = AX. Now multiply both sides by 
AT}: 


Ax = \x 
AtAK = AT1AK 
x=” Xx 
1, > 
ete — A~1x 


We have just shown that A~?X = 1/)x; this, by definition, shows that x is an eigen- 
vector of A~? with eigenvalue 1/.. This explains the result we saw above. 


The eigenvalues and eigenvectors of A and A’. 


Our example showed that A and A’ had the same eigenvalues but different (but 
somehow similar) eigenvectors; it also showed that B and B’ had the same eigenvalues 
but unrelated eigenvectors. Why is this? 

We can answer the eigenvalue question relatively easily; it follows from the prop- 
erties of the determinant and the transpose. Recall the following two facts: 


1. (A+ B)' =A‘ +B" (Theorem 11) and 
2. det (A) = det (A") (Theorem 16). 


We find the eigenvalues of a matrix by computing the characteristic polynomial; 
that is, we find det (A — A/). What is the characteristic polynomial of A’? Consider: 


det (AT — Al) = det (A’ — I’) since / = /" 
= det ((A — \/)’) Theorem 11 
= det (A — Al) Theorem 16 


So we see that the characteristic polynomial of A’ is the same as that for A. There- 
fore they have the same eigenvalues. 

What about their respective eigenvectors? Is there any relationship? The simple 
answer is “No.”14 


11We have defined an eigenvector to be a column vector. Some mathematicians prefer to use row vectors 
instead; in that case, the typical eigenvalue/eigenvector equation looks like XA = AX. It turns out that doing 
things this way will give you the same eigenvalues as our method. What is more, take the transpose of the 
above equation: you get (xA)’ = (Ax)! which is also AX’ = Ax’. The transpose of a row vector is a 
column vector, so this equation is actually the kind we are used to, and we can say that x” is an eigenvector 
of Al. 

In short, what we find is that the eigenvectors of A’ are the “row” eigenvectors of A, and vice-versa. 


4.2 Properties of Eigenvalues and Eigenvectors 


The eigenvalues and eigenvectors of A and The Trace. 


Note that the eigenvalues of A are —6 and 12, and the trace is 6; the eigenvalues 
of B are —1, 2 and 3, and the trace of B is 4. Do we notice any relationship? 


It seems that the sum of the eigenvalues is the trace! Why is this the case? 


The answer to this is a bit out of the scope of this text; we can justify part of this 
fact, and another part we’ll just state as being true without justification. 


First, recall from Theorem 13 that tr(AB) =tr(BA). Secondly, we state without jus- 
tification that given a square matrix A, we can find a square matrix P such that P~ AP is 
an upper triangular matrix with the eigenvalues of A on the diagonal.’? Thus tr(P” *AP) 
is the sum of the eigenvalues; also, using our Theorem 13, we know that tr(P-*AP) = 
tr(P-*PA) =tr(A). Thus the trace of A is the sum of the eigenvalues. 


The eigenvalues and eigenvectors of A and The Determinant. 


Again, the eigenvalues of A are —6 and 12, and the determinant of A is —72. The 
eigenvalues of B are —1, 2 and 3; the determinant of B is —6. It seems as though the 
product of the eigenvalues is the determinant. 


This is indeed true; we defend this with our argument from above. We know that 
the determinant of a triangular matrix is the product of the diagonal elements. There- 
fore, given a matrix A, we can find P such that P~!AP is upper triangular with the 
eigenvalues of A on the diagonal. Thus det (P~*AP) is the product of the eigenvalues. 
Using Theorem 16, we know that det (P~*AP) = det (P~*PA) = det (A). Thus the 
determinant of A is the product of the eigenvalues. 


We summarize the results of our example with the following theorem. 


Who in the world thinks up this stuff? It seems that the answer is Marie Ennemond Camille Jordan, 
who, despite having at least two girl names, was a guy. 
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Theorem 19 Properties of Eigenvalues and Eigenvectors 


Let A be ann x ninvertible matrix. The following are true: 


1. If A is triangular, then the diagonal elements of A are 
the eigenvalues of A. 


. If Ais an eigenvalue of A with eigenvector x, then + is 
an eigenvalue of A~* with eigenvector x. 


. If X is an eigenvalue of A then X is an eigenvalue of 
A 


. The sum of the eigenvalues of A is equal to tr(A), the 
trace of A. 


. The product of the eigenvalues of A is the equal to 
det (A), the determinant of A. 


There is one more concept concerning eigenvalues and eigenvectors that we will 
explore. We do so in the context of an example. 


Example 92 Find the eigenvalues and eigenvectors of the matrixA = | : : | : 
SOLUTION To find the eigenvalues, we compute det (A — XI): 
1-A 2 
det (A — a1) =| 1 | 
= (1—A)(2— A) -2 
= —3) 
= A(A — 3) 


Our eigenvalues are therefore \ = 0, 3. 
For A = 0, we find the eigenvectors: 


12 0) Fe |1 2 0 
1 2 0 0 0 0 


This shows that x; = —2x>, and so our eigenvectors x are 


xX =x 2 
a nn 
For \ = 3, we find the eigenvectors: 
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—2 2 0; Fe |1 1 0 
1 -1 0 0 oO O 


This shows that x, = x2, and so our eigenvectors x are 
x=x : 
=%|4|- 


One interesting thing about the above example is that we see that 0 is an eigen- 
value of A; we have not officially encountered this before. Does this mean anything 
significant? 

Think about what an eigenvalue of 0 means: there exists an nonzero vector x where 
AX = 0X = O. That is, we have a nontrivial solution to AX = 0. We know this only 
happens when A is not invertible. 

So if A is invertible, there is no nontrivial solution to AX = 0, and hence 0 is not 
an eigenvalue of A. If A is not invertible, then there is a nontrivial solution to AX = 0, 
and hence 0 js an eigenvalue of A. This leads us to our final addition to the Invertible 
Matrix Theorem. 


Theorem 20 Invertible Matrix Theorem 


Let A be annxn matrix. The following statements are equiv- 
alent. 


(a) A is invertible. 


(h) A does not have an eigenvalue of 0. 


This section is about the properties of eigenvalues and eigenvectors. Of course, we 
have not investigated all of the numerous properties of eigenvalues and eigenvectors; 
we have just surveyed some of the most common (and most important) concepts. 
Here are four quick examples of the many things that still exist to be explored. 

First, recall the matrix 


that we used in Example 84. It’s characteristic polynomial is p(A) = Sah 5. 
Compute p(A); that is, compute A? — 4A —5]. You should get something “interesting,” 
and you should wonder “does this always work?”4 


3since 0 is a “special” number, we might think so — afterall, we found that having a determinant of 0 is 
important. Then again, a matrix with a trace of 0 isn’t all that important. (Well, as far as we have seen; it 
actually is). So, having an eigenvalue of 0 may or may not be significant, but we would be doing well if we 
recognized the possibility of significance and decided to investigate further. 

Yes. 
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Second, in all of our examples, we have considered matrices where eigenvalues 
“appeared only once.” Since we know that the eigenvalues of a triangular matrix ap- 
pear on the diagonal, we know that the eigenvalues of 


are “1 and 1;” that is, the eigenvalue \ = 1 appears twice. What does that mean when 
we consider the eigenvectors of \ = 1? Compare the result of this to the matrix 


which also has the eigenvalue \ = 1 appearing twice. 
Third, consider the matrix 
0 -1 
eel 


What are the eigenvalues?!© We quickly compute the characteristic polynomial to be 
p(\) = \* + 1. Therefore the eigenvalues are +,/—1 = +i. What does this mean? 

Finally, we have found the eigenvalues of matrices by finding the roots of the char- 
acteristic polynomial. We have limited our examples to quadratic and cubic polynomi- 
als; one would expect for larger sized matrices that a computer would be used to factor 
the characteristic polynomials. However, in general, this is not how the eigenvalues 
are found. Factoring high order polynomials is too unreliable, even with a computer 
— round off errors can cause unpredictable results. Also, to even compute the charac- 
teristic polynomial, one needs to compute the determinant, which is also expensive 
(as discussed in the previous chapter). 

So how are eigenvalues found? There are iterative processes that can progressively 
transform a matrix A into another matrix that is a/most an upper triangular matrix (the 
entries below the diagonal are almost zero) where the entries on the diagonal are the 
eigenvalues. The more iterations one performs, the better the approximation is. 

These methods are so fast and reliable that some computer programs convert poly- 
nomial root finding problems into eigenvalue problems! 

Most textbooks on Linear Algebra will provide direction on exploring the above 
topics and give further insight to what is going on. We have mentioned all the eigen- 
value and eigenvector properties in this section for the same reasons we gave in the 
previous section. First, knowing these properties helps us solve numerous real world 
problems, and second, it is fascinating to see how rich and deep the theory of matrices 
is. 


15To direct further study, it helps to know that mathematicians refer to this as the duplicity of an eigen- 
value. In each of these two examples, A has the eigenvalue \ = 1 with duplicity of 2. 
16Be careful; this matrix is not triangular. 
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Exercises 4.2 


In Exercises 1-6, a matrix A is given. For 2. —2 || 
each, | —1 3 
(a) Find the eigenvalues of A, and for each 3 [5 30 
eigenvalue, find an eigenvector. “|-1 —6 
(b) Do the same for A’. , [a4 || 
(c) Do the same for A~?. “[-1 13 
(d) Find tr(A). 5 -9 0 
. 5 1 -5 0 
(e) Find det (A). 5 a 8 
Use Theorem 19 to verify your results. 
0 25 O 
1 0 4 6. | 1 O 0 
“)-1 5 1 1 -3 
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We already looked at the basics of graphing vectors. In this chapter, we’ll explore these 
ideas more fully. One often gains a better understanding of a concept by “seeing” it. 
For instance, one can study the function f(x) = x? and describe many properties of 
how the output relates to the input without producing a graph, but the graph can 
quickly bring meaning and insight to equations and formulae. Not only that, but the 
study of graphs of functions is in itself a wonderful mathematical world, worthy of 
exploration. 

We’ve studied the graphing of vectors; in this chapter we’ll take this a step further 
and study some fantastic graphical properties of vectors and matrix arithmetic. We 
mentioned earlier that these concepts form the basis of computer graphics; in this 
chapter, we’ll see even better how that is true. 


5.1 Transformations of the Cartesian Plane 


AS YOU READ 


1. To understand how the Cartesian plane is affected by multiplication by a matrix, 
it helps to study how what is affected? 


2. Transforming the Cartesian plane through matrix multiplication transforms straight 
lines into what kind of lines? 


3. T/F: If one draws a picture of a sheep on the Cartesian plane, then transformed 
the plane using the matrix 
-1 0 
Oo 1)’ 


one could say that the sheep was “sheared.” 
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We studied in Section 2.3 how to visualize vectors and how certain matrix arith- 
metic operations can be graphically represented. We limited our visual understanding 
of matrix multiplication to graphing a vector, multiplying it by a matrix, then graphing 
the resulting vector. In this section we'll explore these multiplication ideas in greater 
depth. Instead of multiplying individual vectors by a matrix A, we’ll study what hap- 
pens when we multiply every vector in the Cartesian plans by A.? 


Because of the Distributive Property as we saw demonstrated way back in Example 
41, we can say that the Cartesian plane will be transformed in a very nice, predictable 
way. Straight lines will be transformed into other straight lines (and they won’t become 
curvy, or jagged, or broken). Curved lines will be transformed into other curved lines 
(perhaps the curve will become “straight,” but it won’t become jagged or broken). 

One way of studying how the whole Cartesian plane is affected by multiplication by 
a matrix A is to study how the unit square is affected. The unit square is the square with 
corners at the points (0,0), (1,0), (1,1), and (0,1). Each corner can be represented 
by the vector that points to it; multiply each of these vectors by A and we can get an 
idea of how A affects the whole Cartesian plane. 

Let’s try an example. 


Example 93 Plot the vectors of the unit square before and after they have been 
multiplied by A, where 


SOLUTION The four corners of the unit square can be represented by the vec- 


lo] Lol] 


Multiplying each by A gives the vectors 


respectively. 
(Hint: one way of using your calculator to do this for you quickly is to make a2 x 4 
matrix whose columns are each of these vectors. In this case, create a matrix 


1No, we won’t do them one by one. 


5.1 Transformations of the Cartesian Plane 


This saves time, especially if you do a similar procedure for multiple matrices A. Of 
course, we can save more time by skipping the first column; since it is the column of 
zeros, it will stay the column of zeros after multiplication by A.) 

The unit square and its transformation are graphed in Figure 5.1, where the shaped 
vertices correspond to each other across the two graphs. Note how the square got 
turned into some sort of quadrilateral (it’s actually a parallelogram). A really interest- 
ing thing is how the triangular and square vertices seem to have changed places — it is 
as though the square, in addition to being stretched out of shape, was flipped. 


y y 


Figure 5.1: Transforming the unit square by matrix multiplication in Example 93. 


> 


Figure 5.2: Emphasizing straight lines going to straight lines in Example 93. 


To stress how “straight lines get transformed to straight lines,” consider Figure 5.2. 
Here, the unit square has some additional points drawn on it which correspond to the 
shaded dots on the transformed parallelogram. Note how relative distances are also 
preserved; the dot halfway between the black and square dots is transformed to a po- 
sition along the line, halfway between the black and square dots. 


Much more can be said about this example. Before we delve into this, though, let’s 
try one more example. 
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Example 94 Plot the transformed unit square after it has been transformed by 
A, where 
Oo -1 
A= : 
SOLUTION We'll put the vectors that correspond to each corner in a matrix B 


as before and then multiply it on the left by A. Doing so gives: 
Oo -1//0 110 
ae= lt o][o oa 1 
_{0 0 -1 -1 
~10 1 #21 0 
In Figure 5.3 the unit square is again drawn along with its transformation by A. 


y y 


Figure 5.3: Transforming the unit square by matrix multiplication in Example 94. 


Make note of how the square moved. It did not simply “slide” to the left;? nor 
did it “flip” across the y axis. Rather, it was rotated counterclockwise about the origin 
90°. In a rotation, the shape of an object does not change; in our example, the square 
remained a square of the same size. 


We have broached the topic of how the Cartesian plane can be transformed via 
multiplication by a2 x 2 matrix A. We have seen two examples so far, and our intuition 
as to how the plane is changed has been informed only by seeing how the unit square 
changes. Let’s explore this further by investigating two questions: 


1. Suppose we want to transform the Cartesian plane in a known way (for instance, 
we may want to rotate the plane counterclockwise 180°). How do we find the 
matrix (if one even exists) which performs this transformation? 


2. How does knowing how the unit square is transformed really help in understand- 
ing how the entire plane is transformed? 


These questions are closely related, and as we answer one, we will help answer 
the other. 


2mathematically, that is called a translation 
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To get started with the first question, look back at Examples 93 and 94 and con- 
sider again how the unit square was transformed. In particular, is there any correlation 
between where the vertices ended up and the matrix A? 

If you are just reading on, and haven’t actually gone back and looked at the exam- 
ples, go back now and try to make some sort of connection. Otherwise — you may have 
noted some of the following things: 


1. The zero vector (0, the “black” corner) never moved. That makes sense, though; 
AO = 0. 


: is always 
ol’ y' 


2. The “square” corner, i.e., the corner corresponding to the vector | 


transformed to the vector in the first column of A! 


3. Likewise, the “triangular” corner, i.e., the corner corresponding to the vector 


O;. . 
1 , is always transformed to the vector in the second column of A!? 


4. The “white dot” corner is always transformed to the sum of the two column 
vectors of A.4 


Let’s now take the time to understand these four points. The first point should be 
clear; Owill always be transformed to Ovia matrix multiplication. (Hence the hint in the 
middle of Example 93, where we are told that we can ignore entering in the column of 
zeros in the matrix B.) 

We can understand the second and third points simultaneously. Let 


a b eee fe + |0 
a=|? Ar =| and é=|t). 


What are Ae, and Ae? 


3 Although this is less of a surprise, given the result of the previous point. 
4This observation is a bit more obscure than the first three. It follows from the fact that this corner of 
the unit square is the “sum” of the other two nonzero corners. 
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So by mere mechanics of matrix multiplication, the square corner e; is transformed 
to the first column of A, and the triangular corner e3 is transformed to the second col- 
umn of A. A similar argument demonstrates why the white dot corner is transformed 
to the sum of the columns of A.° 


Revisit now the question “How do we find the matrix that performs a given trans- 
formation on the Cartesian plane?” The answer follows from what we just did. Think 
about the given transformation and how it would transform the corners of the unit 
square. Make the first column of A the vector where e; goes, and make the second 
column of A the vector where é3 goes. 


Let’s practice this in the context of an example. 


Example 95 Find the matrix A that flips the Cartesian plane about the x axis and 
then stretches the plane horizontally by a factor of two. 


. . - 1 : 
SOLUTION We first consider e; = A . Where does this corner go to under 
the given transformation? Flipping the plane across the x axis does not change é€; at 


all; stretching the plane sends e; to El . Therefore, the first column of A is H ‘ 
: ~ 0 ee . ~ 
Now consider e) = 1\: Flipping the plane about the x axis sends e> to the vec- 
tor | 5 | ; subsequently stretching the plane horizontally does not affect this vector. 


Therefore the second column of A is | : 


Putting this together gives 


To help visualize this, consider Figure 5.4 where a shape is transformed under this 
matrix. Notice how it is turned upside down and is stretched horizontally by a factor 
of two. (The gridlines are given as a visual aid.) 


5 Another way of looking at all of this is to consider what A - / is: of course, it is just A. What are the 
columns of /? Just é; and é3. 


5.1 Transformations of the Cartesian Plane 


' ' 


Figure 5.4: Transforming the Cartesian plane in Example 95 


Awhile ago we asked two questions. The first was “How do we find the matrix that 
performs a given transformation?” We have just answered that question (although we 
will do more to explore it in the future). The second question was “How does knowing 
how the unit square is transformed really help us understand how the entire plane is 
transformed?” 

Consider Figure 5.5 where the unit square (with vertices marked with shapes as 
before) is shown transformed under an unknown matrix. How does this help us un- 
derstand how the whole Cartesian plane is transformed? For instance, how can we 
use this picture to figure out how the point (2, 3) will be transformed? 


y 


A 


Figure 5.5: The unit square under an unknown transformation. 


There are two ways to consider the solution to this question. First, we know now 
how to compute the transformation matrix; the new position of e; is the first column 
of A, and the new position of e3 is the second column of A. Therefore, by looking at 
the figure, we can deduce that 

1 -1 
A= - 


SAt least, A is close to that. The square corner could actually be at the point (1.01, .99). 
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To find where the point (2, 3) is sent, simply multiply 


1 -1])f2]_ [-1 
1 2 3; | 8 I° 
There is another way of doing this which isn’t as computational — it doesn’t involve 
computing the transformation matrix. Consider the following equalities: 


B}=[o)+[3 


This last equality states something that is somewhat obvious: to arrive at the vector 
2 eh ee —_ oe tine asd : 
| 3 | , one needs to go 2 units in the e; direction and 3 units in the e3 direction. To find 


where the point (2, 3) is transformed, one needs to go 2 units in the new e; direction 
and 3 units in the new eé) direction. This is demonstrated in Figure 5.6. 


y 
new location of n 
the point (2, 3) 


\ 
3 x “new” e5 
\ 
\ 
% 
% 
\ 
\ 


77 
“2x “new” e; 
7 

¢ 

<+—_+—_ +> X 


y 


Figure 5.6: Finding the new location of the point (2, 3). 


We are coming to grips with how matrix transformations work. We asked two ba- 
sic questions: “How do we find the matrix for a given transformation?” and “How 
do we understand the transformation without the matrix?”, and we’ve answered each 
accompanied by one example. Let’s do another example that demonstrates both tech- 
niques at once. 


Example 96 First, find the matrix A that transforms the Cartesian plane by stretch- 
ing it vertically by a factor of 1.5, then stretches it horizontally by a factor of 0.5, then 
rotates it clockwise about the origin 90°. Secondly, using the new locations of e; and 
€>, find the transformed location of the point (—1, 2). 
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SOLUTION To find A, first consider the new location of €;. Stretching the plane 


vertically does not affect e); stretching the plane horizontally by a factor of 0.5 changes 
- 1/2 oa 
e;to ; | This is the 
first column of A. 


Now consider the new location of €5. Stretching the plane vertically changes it to 


, and then rotating it 90° about the origin moves it to | a 


) 3/2 
i | ; stretching horizontally does not affect it, and rotating 90° moves it to | ; | ‘ 


This is then the second column of A. This gives 


Where does the point (—1,2) get sent to? The corresponding vector | is 


found by going —1 units in the e; direction and 2 units in the e5 direction. Therefore, 
the transformation will send the vector to —1 units in the new e direction and 2 units 
in the new é direction. This is sketched in Figure 5.7, along with the transformed unit 
square. We can also check this multiplicatively: 


[te T][2'] = Ln] 


Figure 5.8 shows the effects of the transformation on another shape. 


Figure 5.7: Understanding the transformation in Example 96. 


vy 


Figure 5.8: Transforming the Cartesian plane in Example 96 195 
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Right now we are focusing on transforming the Cartesian plane — we are making 2D 
transformations. Knowing how to do this provides a foundation for transforming 3D 
space,’ which, among other things, is very important when producing 3D computer 
graphics. Basic shapes can be drawn and then rotated, stretched, and/or moved to 
other regions of space. This also allows for things like “moving the camera view.” 


What kinds of transformations are possible? We have already seen some of the 
things that are possible: rotations, stretches, and flips. We have also mentioned some 
things that are not possible. For instance, we stated that straight lines always get trans- 
formed to straight lines. Therefore, we cannot transform the unit square into a circle 
using a matrix. 


Let’s look at some common transformations of the Cartesian plane and the matri- 
ces that perform these operations. In the following figures, a transformation matrix 
will be given alongside a picture of the transformed unit square. (The original unit 
square is drawn lightly as well to serve as a reference.) 


2D Matrix Transformations 


y 
Horizontal stretch by a 
factor of k. (k, 1) 
k O 
0 1 
x 
y 
Vertical stretch by a (1, k) 
factor of k. 
1 0 
Ook 
~ { > X 


7 Actually, it provides a foundation for doing it in 4D, 5D, .. ., 17D, etc. Those are just harder to visualize. 


5.1 Transformations of the Cartesian Plane 


Horizontal shear by a | 
factor of k. 


fo 3] Z 


Vertical shear by a factor 


of k. 
1 0 
k 1 


Horizontal reflection 
across the y axis. 


ols 30 - =x 
fo 4 


Vertical reflection across 
the x axis. 


1 O os 
0 -1 
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Diagonal reflection 
across the line y = x. 


[ro] , 


Rotation around the a 
origin by an angle of @. 


cos@ —sin@ 
sind  cos@ 


Projection onto the x y 
axis. + 
(Note how the square is 
“squashed” down onto :.- = 

the x-axis.) x 


fo 9] | 


Projection onto the y y 

axis. 

(Note how the square is 

“squashed” over onto 

the y-axis.) < > xX 


fo 5] 


Now that we have seen a healthy list of transformations that we can perform on 
the Cartesian plane, let’s practice a few more times creating the matrix that gives the 
desired transformation. In the following example, we develop our understanding one 
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more critical step. 


Example 97 Find the matrix A that transforms the Cartesian plane by perform- 
ing the following operations in order: 


1. Vertical shear by a factor of 3. Horizontal stretch by a fac- 


0.5 tor of 2 
2. Counterclockwise rotation 4. Diagonal reflection across 
about the origin by an angle the line y = x 
of 6 = 30° 
SOLUTION Wow! We already know how to do this — sort of. We know we 


can find the columns of A by tracing where e; and e> end up, but this also seems 
difficult. There is so much that is going on. Fortunately, we can accomplish what we 
need without much difficulty by being systematic. 

First, let’s perform the vertical shear. The matrix that performs this is 


1 0 
aia E ih 


After that, we want to rotate everything clockwise by 30°. To do this, we use 


cos 30° —sin30° /3/2 -1/2 
AN aa ne , 
sin30° cos 30 1/2. 3/2 


In order to do both of these operations, in order, we multiply A.A;.° 
To perform the final two operations, we note that 


2 O 0 1 
a= | | and ae=|2 | 


perform the horizontal stretch and diagonal reflection, respectively. Thus to perform 
all of the operations “at once,” we need to multiply by 


A= AjA3A2A, 
“(2 ae) [o s|[c2 rales 3] 
- Rae _ | 


2/3-1)/2 =1 
_ [0.933 0.866 
™ 11.232 -1 


8The reader might ask, “Is it important to do multiply these in that order? Could we have multiplied 
A1A2 instead?” Our answer starts with “Is matrix multiplication commutative?” The answer to our question 
is “No,” so the answers to the reader’s questions are “Yes” and “No,” respectively. 
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Let’s consider this closely. Suppose | want to know where a vector x ends up. We 
claim we can find the answer by multiplying Ax. Why does this work? Consider: 


AX = AgA3AnAiX 


= AygA3A2(Aix) (performs the vertical shear) 

= AyA3(A2x1) (performs the rotation) 

= Aq(A3X) (performs the horizontal stretch) 
= A,X; (performs the diagonal reflection) 
=X, (the result of transforming x) 


Most readers are not able to visualize exactly what the given list of operations does 
to the Cartesian plane. In Figure 5.9 we sketch the transformed unit square; in Figure 
5.10 we sketch a shape and its transformation. 


Figure 5.9: The transformed unit square in Example 97. 


y 


Figure 5.10: A transformed shape in Example 97. 


Once we know what matrices perform the basic transformations,” performing com- 
plex transformations on the Cartesian plane really isn’t that ... complex. It boils down 
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to multiplying by a series of matrices. 


5.1 Transformations of the Cartesian Plane 


We’ve shown many examples of transformations that we can do, and we’ve men- 
tioned just a few that we can’t — for instance, we can’t turn a square into a circle. Why 
not? Why is it that straight lines get sent to straight lines? We spent a lot of time within 
this text looking at invertible matrices; what connections, if any,'° are there between 
invertible matrices and their transformations on the Cartesian plane? 

All these questions require us to think like mathematicians — we are being asked to 
study the properties of an object we just learned about and their connections to things 
we’ve already learned. We’ll do all this (and more!) in the following section. 


Exercises 5.1 


In Exercises 1-4, asketch of transformed unit In Exercises 5 — 10, a list of transformations is 
square is given. Find the matrix A that per- given. Find the matrix A that performs those 


forms this transformation. 


transformations, in order, on the Cartesian 


y plane. 
4 5. (a) vertical shear by a factor of 2 
(b) horizontal shear by a factor of 2 
6. (a) horizontal shear by a factor of 2 
(b) vertical shear by a factor of 2 
7. (a) horizontal stretch by a factor of 3 
(b) reflection across the line y = x 
8. (a) counterclockwise rotation by an 
angle of 45° 
(b) vertical stretch by a factor of 1/2 
y 
A 9. (a) clockwise rotation by an angle of 
90° 
(b) horizontal reflection across the y 
2. axis 
1 a (c) vertical shear by a factor of 1 
Yy 
10. (a) vertical reflection across the x 
4 axis 
(b) horizontal reflection across the y 
axis 
3. 1 (c) diagonal reflection across the line 
x y=x 
if 1 
In Exercises 11 — 14, two sets of transforma- 
y tions are given. Sketch the transformed unit 
4. 1 # square under each set of transformations. 
A Be ctie x Are the transformations the same? Explain 
1 why/why not. 


10By now, the reader should expect connections to exist. 
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11. (a) a horizontal reflection across the 13. (a) ahorizontal stretch by a factor of 
y axis, followed by a vertical re- 1/2 followed by a vertical stretch 
flection across the x axis, com- by a factor of 3, compared to 
pared to (b) the same operations but in oppo- 

(b) a counterclockise rotation of site order 
180° 14. (a) a reflection across the line y = 


x followed by a reflection across 


12. (a) ahorizontal stretch by a factor of the x axis, compared to 


2 followed by a reflection across 


the line y = x, compared to (b) a reflection across the the y axis, 


followed by a reflection across 
(b) a vertical stretch by a factor of 2 the line y = x. 


5.2 Properties of Linear Transformations 


AS YOU READ 


1. T/F: Translating the Cartesian plane 2 units up is a linear transformation. 


2. T/F: If Tis a linear transformation, then T(0) = 0. 


In the previous section we discussed standard transformations of the Cartesian 
plane — rotations, reflections, etc. As a motivational example for this section’s study, 
let’s consider another transformation — let’s find the matrix that moves the unit square 
one unit to the right (see Figure 5.11). This is called a translation. 


y y 
AL x anet x 
Figure 5.11: Translating the unit square one unit to the right. 
Our work from the previous section allows us to find the matrix quickly. By looking 


: ap ae 2 = 1 
at the picture, it is easy to see that e; is moved to 0 and e> is moved to 1|° 


Therefore, the transformation matrix should be 


However, look at Figure 5.12 where the unit square is drawn after being trans- 
formed by A. It is clear that we did not get the desired result; the unit square was not 
translated, but rather stretched/sheared in some way. 


5.2 Properties of Linear Transformations 


Figure 5.12: Actual transformation of the unit square by matrix A. 


What did we do wrong? We will answer this question, but first we need to develop 
a few thoughts and vocabulary terms. 

We’ve been using the term “transformation” to describe how we’ve changed vec- 
tors. In fact, “transformation” is synonymous to “function.” We are used to functions 
like f(x) = x’, where the input is a number and the output is another number. In the 
previous section, we learned about transformations (functions) where the input was 
a vector and the output was another vector. If A is a “transformation matrix,” then we 
could create a function of the form T(x) = Ax. That is, a vector x is the input, and the 
output is X multiplied by A.1" 

When we defined f(x) = x? above, we let the reader assume that the input was 
indeed a number. If we wanted to be complete, we should have stated 


f:R—R_ where f(x) =x’. 


The first part of that line told us that the input was a real number (that was the first 
R) and the output was also a real number (the second R). 

To define a transformation where a 2D vector is transformed into another 2D vector 
via multiplication by a 2 x 2 matrix A, we should write 


T:R*— R? where T(x) = Ax. 
Here, the first R* means that we are using 2D vectors as our input, and the second R? 


means that a 2D vector is the output. 
Consider a quick example: 


2 
xy 
T:R* 5 R? where i= 2x1 


Notice that this takes 2D vectors as input and returns 3D vectors as output. For in- 


stance, 
9 
a= 
—6 


We now define a special type of transformation (function). 


11We used T instead of f to define this function to help differentiate it from “regular” functions. “Nor- 
mally” functions are defined using lower case letters when the input is a number; when the input is a vector, 
we use upper case letters. 
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Definition 29 Linear Transformation 


A transformation T : IR” — R"” is a linear transformation if 
it satisfies the following two properties: 


T(x + y) = T(X) + T(V) for all vectors xX and y, and 


2. T(kx) = kT(x) for all vectors x and all scalars k. 


If Tis a linear transformation, it is often said that “Tis linear.” 


Let’s learn about this definition through some examples. 


Example 98 Determine whether or not the transformation T : R? + R?isa 
linear transformation, where 


SOLUTION We'll arbitrarily pick two vectors xX and y: 


[3] = [3] 


Let’s check to see if Tis linear by using the definition. 


1. Is T(x + ¥) = T(x) + T(y)? First, compute x 


ale :) [] 


Now compute 7(x), T(y), and T(x + y): 


r@=7(| 2,]) rw =T(|5]) na+9=T(|31) 


9 1 16 
—6 5 12 


9 1] , [16 
6/+ /2|/4]8 
—6 5 12 
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Therefore, Tis not a linear transformation. 
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So we have an example of something that doesn’t work. Let’s try an example where 
things do work.’2 


Example 99 Determine whether or not the transformation T : R* — Risa 
linear transformation, where T(x) = Ax and 
1 2 
A= E ;| | 
SOLUTION Let’s start by again considering arbitrary x and y. Let’s choose the 


same x and y from Example 98. 


[2] = [i 


If the lineararity properties hold for these vectors, then maybe it is actually linear (and 
we’ll do more work). 


1. Is T(X +) = T(x) + T(y)? Recall: 


7) =1(| 2, ]) 7) =T(|5]) na+9=T(|31) 


lI 
a | 
pa, sl 

ray 
SS | 

II 
| a | 
NP 
wr 
a | 

lI 
| | | 
Ne 
fo 
———aael 


Is T(X +¥) = T(x) + T(V)? 


So far, so good: T(x + y) is equal to T(x) + T(y). 


12 Recall a principle of logic: to show that something doesn’t work, we just need to show one case where 
it fails, which we did in Example 98. To show that something always works, we need to show it works 
for all cases — simply showing it works for a few cases isn’t enough. However, doing so can be helpful in 
understanding the situation better. 
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2. Is T(kx) = kT(X)? Let’s arbitrarily pick k = 7, and use x as before. 


roo=1( Zl) 
72 


=7-T(x) | 


So far it seems that T is indeed linear, for it worked in one example with arbitrarily 
chosen vectors and scalar. Now we need to try to show it is always true. 
Consider T(x + y). By the definition of T, we have 


T(x +y) =A(X+4+y). 
By Theorem 3, part 2 (on page 62) we state that the Distributive Property holds for 


matrix multiplication.? So A(x + y) = AX + Ay. Recognize now that this last part is 
just T(x) + T(y)! We repeat the above steps, all together: 


T(x + y) = A(X + ¥) (by the definition of T in this example) 
= AX + Ay (by the Distributive Property) 
= T(X) + T(Y) (again, by the definition of 7) 


Therefore, no matter what xX and y are chosen, T(x + y) = T(x) + T(y). Thus the first 
part of the lineararity definition is satisfied. 
The second part is satisfied in a similar fashion. Let k be a scalar, and consider: 


T(kx) = A(kx) (by the definition of Tis this example) 
= kAX (by Theorem 3 part 3) 
= kT(x) (again, by the definition of T) 


Since T satisfies both parts of the definition, we conclude that T is a linear trans- 
formation. 


We have seen two examples of transformations so far, one which was not linear and 
one that was. One might wonder “Why is linearity important?”, which we’ll address 
shortly. 

First, consider how we proved the transformation in Example 99 was linear. We 
defined T by matrix multiplication, that is, T(x) = Ax. We proved T was linear using 
properties of matrix multiplication — we never considered the specific values of A! That 
is, we didn’t just choose a good matrix for T; any matrix A would have worked. This 


13 Recall that a vector is just a special type of matrix, so this theorem applies to matrix-vector multiplica- 
tion as well. 


5.2 Properties of Linear Transformations 


leads us to an important theorem. The first part we have essentially just proved; the 
second part we won’t prove, although its truth is very powerful. 


Theorem 21 Matrices and Linear Transformations 


1. Define T : R’ > R” by T(x) = Ax, where A is an 
m X n matrix. Then T is a linear transformation. 


2. Let T: R” > R" be any linear transformation. Then 
there exists an unique m xn matrix A such that T(x) = 
AX. 


The second part of the theorem says that al/ linear transformations can be de- 
scribed using matrix multiplication. Given any linear transformation, there is a matrix 
that completely defines that transformation. This important matrix gets its own name. 


Definition 30 Standard Matrix of a Linear Transformation 


Let T: R” — R” bea linear transformation. By Theorem 
21, there is a matrix A such that T(x) = Ax. This matrix A 


is called the standard matrix of the linear transformation T, 
and is denoted [ T]. 


°The matrix—like brackets around T suggest that the standard matrix A 
is a matrix “with T inside.” 


While exploring all of the ramifications of Theorem 21 is outside the scope of this 
text, let it suffice to say that since 1) linear transformations are very, very important 
in economics, science, engineering and mathematics, and 2) the theory of matrices is 
well developed and easy to implement by hand and on computers, then 3) it is great 
news that these two concepts go hand in hand. 

We have already used the second part of this theorem in a small way. In the pre- 
vious section we looked at transformations graphically and found the matrices that 
produced them. At the time, we didn’t realize that these transformations were linear, 
but indeed they were. 

This brings us back to the motivating example with which we started this section. 
We tried to find the matrix that translated the unit square one unit to the right. Our 
attempt failed, and we have yet to determine why. Given our link between matrices 
and linear transformations, the answer is likely “the translation transformation is not 
a linear transformation.” While that is a true statement, it doesn’t really explain things 
all that well. Is there some way we could have recognized that this transformation 
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wasn’t linear?“ 
Yes, there is. Consider the second part of the linear transformation definition. It 


states that T(kx) = kT(x) for all scalars k. If we let k = 0, we have T(Ox) = 0- T(x), or 
more simply, T(0) = 0. That is, if Tis to be a linear transformation, it must send the 


zero vector to the zero vector. 
This is a quick way to see that the translation transformation fails to be linear. By 


shifting the unit square to the right one unit, the corner at the point (0,0) was sent to 
the point (1, 0), i-e., 


: was sent to the vector : 
0 O|- 


the vector | 


This property relating to Ois important, so we highlight it here. 


Key Idea 15 Linear Transformations and 0 


Let T: R” — R” bea linear transformation. Then: 


= 


T(6,) = Om. 


That is, the zero vector in IR” gets sent to the zero vector in 
R™. 


The interested reader may wish to read the footnote below.?° 
The Standard Matrix of a Linear Transformation 
It is often the case that while one can describe a linear transformation, one doesn’t 


know what matrix performs that transformation (i.e., one doesn’t know the standard 
matrix of that linear transformation). How do we systematically find it? We’ll need a 


new definition. 


Definition 31 Standard Unit Vectors 


In IR", the standard unit vectors é; are the vectors with a 1 
in the i” entry and Os everywhere else. 


M4 That is, apart from applying the definition directly? 

15 The idea that linear transformations “send zero to zero” has an interesting relation to terminology. The 
reader is likely familiar with functions like f(x) = 2x + 3 and would likely refer to this as a “linear function.” 
However, f(0) 4 0, so fis not “linear” by our new definition of linear. We erroneously call f “linear” since 
its graph produces a line, though we should be careful to instead state that “the graph of fis a line.” 
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We've already seen these vectors in the previous section. In R?, we identified 


é-|5| and a=|t|. 


aa 
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How do these vectors help us find the standard matrix of a linear transformation? 
Recall again our work in the previous section. There, we practiced looking at the trans- 
formed unit square and deducing the standard transformation matrix A. We did this 
by making the first column of A the vector where e; ended up and making the second 
column of A the vector where é ended up. One could represent this with: 


A=([T(éi) T(é)] =[TI. 


That is, T(€1) is the vector where e; ends up, and T(é3) is the vector where é) ends up. 

The same holds true in general. Given a linear transformation T : R” > R™”, the 
standard matrix of T is the matrix whose i** column is the vector where é; ends up. 
While we won’t prove this is true, it is, and it is very useful. Therefore we'll state it 
again as a theorem. 


Theorem 22 The Standard Matrix of a Linear Transformation 


Let T: R” > R” bea linear transformation. Then [T] is the 


m X n matrix: 


Let’s practice this theorem in an example. 


Example 100 Define T : IR? —> R* to be the linear transformation where 
X, +X 
% 1 2 
3x1 — X3 
T X2 = 
2x2 + 5X3 
x3 


4x, + 3x2 + 2x3 
Find [ T]. 
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SOLUTION T takes vectors from R? into R’, so [ T] is going to be a 4 x 3 matrix. 
Note that 
1 0 0 
ep=|]0], & =] 1 and é€;=1|0 
0 0 1 
We find the columns of [| T] by finding where e;, e) and e3 are sent, that is, we find 


T(é1), T(e2) and T(e3). 


1 () 0 
T(é1) =T 0 T(é)) =T 1 T(é3) =T 0 
0 0 1 
1 1 0 
_ 13 _ {0 _ |-1 
~ 10 ae |e: ~ | 5 
4 3 2 
Thus 
11 0 
3 0 -1 
Tl=a= 02 5 
43 2 
Let’s check this. Consider the vector 
1 
x= |2 
3 


Strictly from the original definition, we can compute that 


‘ £3 3 
7 a .i8 0 
T(x) =T : 4415 | | 19 
44646 16 


Now compute T(x) by computing [ T|x= Ax. 


11 0)7, 3 
, |3 0 -1 0 
Nig oa. i ey 39 
Ae 2 16 


They match!*® 


Let’s do another example, one that is more application oriented. 


16 Of course they do. That was the whole point. 
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Example 101 A baseball team manager has collected basic data concerning his 
hitters. He has the number of singles, doubles, triples, and home runs they have hit 
over the past year. For each player, he wants two more pieces of information: the total 
number of hits and the total number of bases. 


Using the techniques developed in this section, devise a method for the manager 
to accomplish his goal. 


SOLUTION If the manager only wants to compute this for a few players, then 
he could do it by hand fairly easily. After all: 


total # hits = # of singles + # of doubles + # of triples + # of home runs, 


and 


total # bases = # of singles + 2x# of doubles + 3x# of triples + 4x# of home runs. 


However, if he has a lot of players to do this for, he would likely want a way to 
automate the work. One way of approaching the problem starts with recognizing that 
he wants to input four numbers into a function (i.e., the number of singles, doubles, 
etc.) and he wants two numbers as output (i.e., number of hits and bases). Thus he 
wants a transformation T : R* —> IR? where each vector in R4 can be interpreted as 


# of singles 
# of doubles 
# of triples 
# of home runs 


and each vector in IR? can be interpreted as 
# of hits 
# of bases | ° 


To find | T], he computes T(e€;), T(€2), T(é3) and T(éq). 
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: Ane + 1 
(What do these calculations mean? For example, finding T(e3) = | | means that 


3 
one triple counts as 1 hit and 3 bases.) 


Thus our transformation matrix [ Tis 


iiana=[} 33 3] 


As an example, consider a player who had 102 singles, 30 doubles, 8 triples and 14 
home runs. By using A, we find that 


meaning the player had 154 hits and 242 total bases. 


A question that we should ask concerning the previous example is “How do we 
know that the function the manager used was actually a linear transformation? After 
all, we were wrong before — the translation example at the beginning of this section 
had us fooled at first.” 

This is a good point; the answer is fairly easy. Recall from Example 98 the transfor- 
mation 
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and from Example 100 


. x1 + XQ 
r. e - 3x1 — x3 
100 x2 = 2x5 + 5x3 ’ 
x3 


4x, + 3X2 + 2X3 


where we use the subscripts for T to remind us which example they came from. 

We found that Tsg was not a linear transformation, but stated that Ty99 was (al- 
though we didn’t prove this). What made the difference? 

Look at the entries of T9g(X) and Ti90(X). Tog contains entries where a variable is 
squared and where 2 variables are multiplied together — these prevent Tog from being 
linear. On the other hand, the entries of T1909 are all of the form ax; +--+ ++ a)Xx,; that 
is, they are just sums of the variables multiplied by coefficients. Tis linear if and only if 
the entries of T(x) are of this form. (Hence linear transformations are related to linear 
equations, as defined in Section 1.1.) This idea is important. 


Key Idea 16 Conditions on Linear Transformations 


Let T : R” — R"” bea transformation and consider the 


entries of 
X1 


X2 
T(x) =T 


Xn 


Tis linear if and only if each entry of T(x) is of the form a x,+ 
2X2 +:++QnXp. 


Going back to our baseball example, the manager could have defined his transfor- 
mation as 


X41 

T X2 = X1 + X2 + X3 + Xa 
X3 X, + 2x2 + 3x3 + 4x, 
X4 


Since that fits the model shown in Key Idea 16, the transformation T is indeed linear 
and hence we can find a matrix [ T] that represents it. 
Let’s practice this concept further in an example. 


Example 102 Using Key Idea 16, determine whether or not each of the following 
transformations is linear. 


n(ls)=[%e"] (La) [Ve] 
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a(a) [7m * 
X2 1X2 

SOLUTION T, is not linear! This may come as a surprise, but we are not al- 
lowed to add constants to the variables. By thinking about this, we can see that this 
transformation is trying to accomplish the translation that got us started in this section 
— it adds 1 to all the x values and leaves the y values alone, shifting everything to the 
right one unit. However, this is not linear; again, notice how 0 does not get mapped 
to 0. 

T is also not linear. We cannot divide variables, nor can we put variabless in- 
side the square root function (among other other things; again, see Section 1.1). This 
means that the baseball manager would not be able to use matrices to compute a 
batting average, which is (number of hits)/(number of at bats). 

Tz is linear. Recall that \/7 and 7 are just numbers, just coefficients. 


We’ve mentioned before that we can draw vectors other than 2D vectors, although 
the more dimensions one adds, the harder it gets to understand. In the next section 
we'll learn about graphing vectors in 3D — that is, how to draw on paper or a computer 
screen a 3D vector. 


Exercises 5.2 


In Exercises 1 — 5, a transformation T is given. 
Determine whether or not T is linear; if not, 7.7 
state why not. 


wre) = [a2] 
| X2 | 3x1 — X2 
ie Paige 8. T 
-r([n = [at 
| X2 | X1 — X2 
[x [xr +1 
3. r( al | X1 X1 + 3x3 
: 7 9. T 


Xi + 2X2 = 3x3 
0 

Xi + 4x3 
5x2 + x3 


X2 = 


X2 => X1 — X3 
é r( =) = | 10. T 
| X2 | 0 


X3 X1 + X3 
In Exercises 6 — 11, a linear transformation T 
is given. Find | T]. 


oral = [ate] 
| X2 | X1 — X2 


= [ x1 + 2x2 + 3x3 + xz | 
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5.3 Visualizing Vectors: Vectors in Three Dimensions 


AS YOU READ 


1. T/F: The viewpoint of the reader makes a difference in how vectors in 3D look. 


2. T/F: If two vectors are not near each other, then they will not appear to be near 
each other when graphed. 


3. T/F: The parallelogram law only applies to adding vectors in 2D. 


We ended the last section by stating we could extend the ideas of drawing 2D vec- 
tors to drawing 3D vectors. Once we understand how to properly draw these vectors, 
addition and subtraction is relatively easy. We’ll also discuss how to find the length of 
a vector in 3D. 

We start with the basics of drawing a vector in 3D. Instead of having just the tradi- 
tional x and y axes, we now add a third axis, the z axis. Without any additional vectors, 
a generic 3D coordinate system can be seen in Figure 5.13. 


Z 


ae 


x 


Figure 5.13: The 3D coordinate system 


In 2D, the point (2, 1) refers to going 2 units in the x direction followed by 1 unit in 
the y direction. In 3D, each point is referenced by 3 coordinates. The point (4, 2, 3) is 
found by going 4 units in the x direction, 2 units in the y direction, and 3 units in the z 
direction. 

How does one sketch a vector on this coordinate system? As one might expect, we 

1 
can sketch the vector V = | 2 | by drawing an arrow from the origin (the point (0,0,0)) 
3 
to the point (1, 2, a). The only “tricky” part comes from the fact that we are trying 
to represent three dimensional space on a two dimensional sheet of paper. However, 


17Of course, we don’t have to start at the origin; all that really matters is that the tip of the arrow is 1 unit 
in the x direction, 2 units in the y direction, and 3 units in the z direction from the origin of the arrow. 
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it isn’t really hard. We’ll discover a good way of approaching this in the context of an 
example. 


Example 103 Sketch the following vectors with their origin at the origin. 
2 1 
v=]1 and u=|] 3 
3 -1 
SOLUTION We'll start with v first. Starting at the origin, move 2 units in the x 


direction. This puts us at the point (2,0,0) on the x axis. Then, move 1 unit in the y 
direction. (In our method of drawing, this means moving 1 unit directly to the right. 
Of course, we don’t have a grid to follow, so we have to make a good approximation of 
this distance.) Finally, we move 3 units in the z direction. (Again, in our drawing, this 
means going straight “up” 3 units, and we must use our best judgment in a sketch to 
measure this.) 

This allows us to locate the point (2, 1, 3); now we draw an arrow from the origin 
to this point. In Figure 5.14 we have all 4 stages of this sketch. The dashed lines show 
us moving down the x axis in (a); in (b) we move over in the y direction; in (c) we move 
up in the z direction, and finally in (d) the arrow is drawn. 


Z Z 


(d) 


Figure 5.14: Stages of sketching the vector V for Example 103. 


5.3 Visualizing Vectors: Vectors in Three Dimensions 


Drawing the dashed lines help us find our way in our representation of three di- 
mensional space. Without them, it is hard to see how far in each direction the vector 
is supposed to have gone. 

To draw U, we follow the same procedure we used to draw V. We first locate the 
point (1,3, —1), then draw the appropriate arrow. In Figure 5.15 we have u drawn 
along with v. We have used different dashed and dotted lines for each vector to help 
distinguish them. 

Notice that this time we had to go in the negative z direction; this just means we 
moved down one unit instead of up a unit. 


Figure 5.15: Vectors V and u in Example 103. 


As in 2D, we don’t usually draw the zero vector, 


It doesn’t point anywhere. It is a conceptually important vector that does not have a 
terribly interesting visualization. 

Our method of drawing 3D objects on a flat surface — a 2D surface — is pretty clever. 
It isn’t perfect, though; visually, drawing vectors with negative components (especially 
negative x coordinates) can look a bit odd. Also, two very different vectors can point 
to the same place. We’ll highlight this with our next two examples. 


—3 
Example 104 Sketch the vector V = | —1 
2 
SOLUTION We use the same procedure we used in Example 103. Starting at 


the origin, we move in the negative x direction 3 units, then 1 unit in the negative y 
direction, and then finally up 2 units in the z direction to find the point (—3, —1, 2). 
We follow by drawing an arrow. Our sketch is found in Figure 5.16; Vis drawn in two 
coordinate systems, once with the helpful dashed lines, and once without. The second 
drawing makes it pretty clear that the dashed lines truly are helpful. 
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> 
Figure 5.16: Vector Vv in Example 104. 
2 —2 
Example 105 Draw the vectorsV¥V = | 4| and&@= | 1 | onthe same coordi- 
2 -1 
nate system. 
SOLUTION We follow the steps we’ve taken before to sketch these vectors, 


shown in Figure 5.17. The dashed lines are aides for V and the dotted lines are aids 
for U. We again include the vectors without the dashed and dotted lines; but without 
these, it is very difficult to tell which vector is which! 


> 
> 


Figure 5.17: Vectors V and U in Example 105. 


Our three examples have demonstrated that we have a pretty clever, albeit imper- 
fect, method for drawing 3D vectors. The vectors in Example 105 look similar because 
of our viewpoint. In Figure 5.18 (a), we have rotated the coordinate axes, giving the 
vectors a different appearance. (Vector V now looks like it lies on the y axis.) 

Another important factor in how things look is the scale we use for the x, y, and 
z axes. In 2D, it is easy to make the scale uniform for both axes; in 3D, it can be a bit 
tricky to make the scale the same on the axes that are “slanted.” Figure 5.18 (b) again 
shows the same 2 vectors found in Example 105, but this time the scale of the x axis 


5.3 Visualizing Vectors: Vectors in Three Dimensions 


is a bit different. The end result is that again the vectors appear a bit different than 
they did before. These facts do not necessarily pose a big problem; we must merely 
be aware of these facts and not make judgments about 3D objects based on one 2D 
: 18 

image. 


Figure 5.18: Vectors V and d in Example 105 with 
a different viewpoint (a) and x axis scale (b). 


We now investigate properties of vector arithmetic: what happens (i.e., how do 
we draw) when we add 3D vectors and multiply by a scalar? How do we compute the 
length of a 3D vector? 


Vector Addition and Subtraction 


In 2D, we saw that we could add vectors together graphically using the Parallelo- 
gram Law. Does the same apply for adding vectors in 3D? We investigate in an example. 


2 1 
Example 106 LetV= |1] andu= | 3 |. SketchV+u. 
3 -1 
SOLUTION We sketched each of these vectors previously in Example 103. We 
3 
sketch them, along withv+u = | 4 |, in Figure 5.19 (a). (We use loosely dashed lines 
2 


for V+ U.) 


18The human brain uses both eyes to convey 3D, or depth, information. With one eye closed (or missing), 
we can have a very hard time with “depth perception.” Two objects that are far apart can seem very close 
together. A simple example of this problem is this: close one eye, and place your index finger about a foot 
above this text, directly above this WORD. See if you were correct by dropping your finger straight down. 
Did you actually hit the proper spot? Try it again with both eyes, and you should see a noticable difference 
in your accuracy. 

Looking at 3D objects on paper is a bit like viewing the world with one eye closed. 
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Does the Parallelogram Law still hold? In Figure 5.19 (b), we draw additional rep- 
resentations of Vv and U to form a parallelogram (without all the dotted lines), which 
seems to affirm the fact that the Parallelogram Law does indeed hold. 


(b) 


Figure 5.19: Vectors Vv, U, and V + U Example 106. 


We also learned that in 2D, we could subtract vectors by drawing a vector from the 
tip of one vector to the other.’9 Does this also work in 3D? We'll investigate again with 
an example, using the familiar vectors V and U from before. 


2 di 
Example 107 LetvV= |1] andud= | 3 |. SketchV—J. 
3 —1 
1 
SOLUTION It is simple to compute that V— G@ = | —2 |. All three of these 
4 


vectors are sketched in Figure 5.20 (a), where again Vis guided by the dashed, u by the 
dotted, and v — d by the loosely dashed lines. 


Does the 2D subtraction rule still hold? That is, can we draw V — u by drawing an 
arrow from the tip of u to the tip of V? In Figure 5.20 (b), we translate the drawing of 
V—Uto the tip of U, and sure enough, it looks like it works. (And in fact, it really does.) 


19Recall that it is important which vector we used for the origin and which was used for the tip. 
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(b) 


Figure 5.20: Vectors v, U, and V — U from Example 107. 


The previous two examples highlight the fact that even in 3D, we can sketch vec- 
tors without explicitly knowing what they are. We practice this one more time in the 
following example. 


Example 108 Vectors Vand U are drawn in Figure 5.21. Using this drawing, sketch 
the vectors V+ U and V — u. 


Sl 


Figure 5.21: Vectors V and u for Example 108. 


SOLUTION Using the Parallelogram Law, we draw v + U by first drawing a gray 
version of U coming from the tip of V; V+ Wis drawn dashed in Figure 5.22. 


To draw V — U, we draw a dotted arrow from the tip of Uto the tip of V. 
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Figure 5.22: Vectors V, U, V + U and Vv — Ud for Example 108. 


Scalar Multiplication 


Given a vector V in 3D, what does the vector 2V look like? How about —v? After 
learning about vector addition and subtraction in 3D, we are probably gaining confi- 
dence in working in 3D and are tempted to say that 2V is a vector twice as long as V, 
pointing in the same direction, and —V is a vector of the same length as V, pointing 
in the opposite direction. We would be right. We demonstrate this in the following 


example. 
Example 109 Sketch V, 2V, and —V, where 
1 
V=|2 
3 
SOLUTION 


x 


Figure 5.23: Sketching scalar multiples of Vv in Example 109. 
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It is easy to compute 


2 —1 
2v= | 4 and —VvV=|]-2 
6 —3 


These are drawn in Figure 5.23. This figure is, in many ways, a mess, with all the 
dashed and dotted lines. They are useful though. Use them to see how each vec- 
tor was formed, and note that 2V at least looks twice as long as V, and it looks like —V 
points in the opposite direction.”° 


Vector Length 


How do we measure the length of a vector in 3D? In 2D, we were able to answer this 
question by using the Pythagorean Theorem. Does the Pythagorean Theorem apply in 
3D? Ina sense, it does. 

1 
Consider the vector V = | 2 |, as drawn in Figure 5.24 (a), with guiding dashed 
3 
lines. Now look at part (b) of the same figure. Note how two lengths of the dashed 
lines have now been drawn gray, and another dotted line has been added. 


(b) 


Figure 5.24: Computing the length of Vv 


These gray dashed and dotted lines form a right triangle with the dotted line form- 
ing the hypotenuse. We can find the length of the dotted line using the Pythagorean 
Theorem. 


length of the dotted line = /sum of the squares of the dashed line lengths 


That is, the length of the dotted line = \/17 + 2? = 5. 


20Qur previous work showed that looks can be deceiving, but it is indeed true in this case. 
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Now consider this: the vector V is the hypotenuse of another right triangle: the 
one formed by the dotted line and the vertical dashed line. Again, we employ the 
Pythagorean Theorem to find its length. 


length of V = length of dashed gray line)? + (length of black dashed line)? 


Thus, the length of Vis (recall, we denote the length of Vv with ||vj|): 


||v|| = (length of gray line)* + (length of black line)? 


= /5 +3? 
= V5+37 


Let’s stop for a moment and think: where did this 5 come from in the previous 
equation? It came from finding the length of the gray dashed line —it came from 17-+2. 
Let’s substitute that into the previous equation: 


\|¥]|] = V54+37 
— 4 /42 +22 4 32 
=vV14 


The key comes from the middle equation: ||v|| = \/1? + 27 + 3%. Do those num- 
bers 1, 2, and 3 look familiar? They are the component values of v! This is very similar 
to the definition of the length of a 2D vector. After formally defining this, we’ll practice 
with an example. 


Definition 32 3D Vector Length 


Let 
X41 
X2 
X3 


The /ength of Vv, denoted ||v/|, is 


[V\] = xp +23 + x3. 
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Example 110 Find the lengths of vectors V and U, where 
2 -4 
v= |-3 and u=| 7 
5 0 
SOLUTION We apply Definition 32 to each vector: 
||¥]| = 4/2? + (-3)2 + 8? 
= /44+9+425 
[||| = / (4)? + 7? + 07 
= V16 + 49 
= V65 


Here we end our investigation into the world of graphing vectors. Extensions into 
graphing 4D vectors and beyond can be done, but they truly are confusing and not 
really done except for abstract purposes. 

There are further things to explore, though. Just as in 2D, we can transform 3D 
space by matrix multiplication. Doing this properly — rotating, stretching, shearing, 
etc. — allows one to manipulate 3D space and create incredible computer graphics. 


Exercises 5.3 


In Exercises 1 — 4, vectors X and y are given. In Exercises 5 — 8, vectors X and y are drawn. 
Sketch x, y,X-+y, andx—yonthesameCarte- Sketch 2x, —y, x + y, and xX — y on the same 


sian axes. Cartesian axes. 

1 2 

1.X¥=]-1],y=1]3 
2 2 + 

x 

2 —1 

2.x= 4 |,y=]|-3 Z 
—1 —1 
1 3 

3.xX= E y= 5. 
2 6 y y 
0 0 

4.¥=]1],y=]-1 
1 1 Xx 
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aaa 


In Exercises 9 — 12, a vector X and a scalar 
a are given. Using Definition 32, compute 
the lengths of x and ax, then compare these 
lengths. 


10..¥=] 4 |,a=-1 
3 


“ML 


1 
12. X=} 2 |,a=3 
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SOLUTIONS TO SELECTED PROBLEMS 


2 
5 


Chapter 1 
Section 1.1 
1. y 
3. y 
5. n 
7. y 
9. y 
11. x=1,y=-2 
13. x= —-1,y=0,z=2 
15. 29 chickens and 33 pigs 
Section 1.2 
3 4 5 
1; -1 1 —3 
2 —2 3 
1 3 -4 
3. —1 0 4 
2 3 4 
5 X1+2x7= 3 
: —xX, + 3x2 = 9 
7 X1 + X2 — X3 — Xa 
: 2x1 + X2 + 3x3 + 5X4 
9 X1 + x3 + 7x5 = 
. X2 + 3x3 + 2x4 = 
2 -1 #7 
11, 5 0 3 
0 4 —2 


17 


2 -1 7 
13. 0 4 —2 
5 8 —1 
2 -1 7 
15. 0 4 —2 
0 5/2 29/2 
17. Ri + R2 — R2 
19. Ry + R2 
21. x=2,y=1 
23. x= —1,y=0 
25. x1 = —2,x2 = 1, x3 =2 
Section 1.3 
1. (a) yes 
(b) no 
3. (a) no 
(b) yes 
f1 0 
= | 0 | 
fl 3 
ys | 0 | 
1 0 
= | 0 1 
fa: 2 2 
2 0 3 | 
1 O 
13. 0 1 
0 OO 


(c) no 


(d) yes 


(c) yes 
(d) yes 


Chapter A Solutions To Selected Problems 


1 0 0O 
15. Oo 1 0 
Oo oO 1 
1 0 0 1 
17. Or <£- Te vt 
0 0 0 0 
1 O 1 3 
19. Oo 1 -2 4 
0 =O 0 0 
1 1 0 0 0 0 
ae 0 013 1 4 
Section 1.4 
1. x1 = 1 — 2x; x2 is free. Possible solutions: 
XxX, = 1,x2 =Oand x, 1° = 1, 
3. xX, =1;x. =2 
5. No solution; the system is inconsistent. 
7. X1 = —11 + 10x3; x2 = —4 + 4x3; x3 is 
free. Possible solutions: x; = —11, 
X2 4, x3 = Oand x, 1, x2 = Oand 
x3 = 1. 
9. xX, = 1— x2 — Xa; X2 is free; x3 = 1 — 2x4; 
Xq is free. Possible solutions: x; = 1, 
X2 = 0,x3 = 1,xq4 = Oandx, 2, 
X2 = 1, x3 = —3,xX, = 2 
11. No solution; the system is inconsistent. 
13..%, = ; _ 4x3; x2 = ; _ 3X35 X3 is free. 
Possible solutions: x1 = $2 = £3 =0 
and x, = —1,x2 =0,x3=1 
15. Never exactly 1 solution; infinite solutions if 
k = 2; no solution if k 4 2. 
17. Exactly 1 solution if k # 2; no solution if 
k = 2; never infinite solutions. 
Section 1.5 
1. 29 chickens and 33 pigs 
3. 42 grande tables, 22 venti tables 
5. 30 men, 15 women, 20 kids 
7. f(x) = —2x + 10 
9. f(x) = 5x2 +3x4+1 
11. f(x) =3 
13. f(x) =x2 41 
15. f(x) = 3x+1 
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17. 


19. 


21. 


23. 


The augmented matrix from this system is 
1) 7 1 1 8 
6 1 2 3 24 
o 1 -1 0 0 
find the solution 


. From this we 


8 1 
ne 
3 3 
8 1 
x=s- = 
3 3 
8 4, 
w= —— -f. 
3 3 


The only time each of these variables are 
nonnegative integers is when f = 2 or 

f = 8. If f = 2, then we have 2 
touchdowns, 2 extra points and 2 two point 
conversions (and 2 field goals); this doesn’t 
make sense since the extra points and two 
point conversions follow touchdowns. If 

f = 8, then we have no touchdowns, extra 
points or two point conversions (just 8 field 
goals). This is the only solution; all points 
were scored from field goals. 


Let x1, x2 and x3 represent the number of 
free throws, 2 point and 3 point shots 
taken. The augmented matrix from this 

1 1 30 
1 2 3 80 
find the solution 


system is . From this we 


X1 = —20+ x3 
X2 = 50 — 2x3. 


In order for x1 and x2 to be nonnegative, we 
need 20 < x3 < 25. Thus there are 6 
different scenerios: the “first” is where 20 
three point shots are taken, no free throws, 
and 10 two point shots; the “last” is where 
25 three point shots are taken, 5 free 
throws, and no two point shots. 


Let y = ax + b; all linear functions through 
(1,3) come in the form y = (3 — b)x +b. 
Examples: b = O yields y = 3x; b = 2 
yields y = x + 2. 


Let y = ax* + bx + c; we find that 
a=—i+icandb=i-— 3c. Examples: 
c= 1yields y = —x + 1;c = 3 yields 
y=x?—-4x+3. 


Chapter 2 


Section 2.1 


9 —7 
2 la | 
—14 
Ale 
—15 
[os 
—5 9 
ar as ee eal 
—5 —2 
a are a 
15. a=2,b=1 
17. a=5/2+43/2b 
19. No solution. 
21. No solution. 
Section 2.2 
1. —22 
3. 0 
5:5. 
7. 15 
9. —2 
11. Not possible. 
8 3 
13. AB= ee cal 
—3 24 
wal? 4] 
—1 -—2 12 
acl | 10 4 | 
BA is not possible. 
17. AB is not possible. 
27 —33 39 
o0e: ee —3 ore 
—32 34 —24 
19. AB= | —32 38 —8 
—16 21 4 
22 —14 
ao e ai 
—56 2 —36 
21. AB= 20 19 —30 
—50 —-—13 0 
—46 40 
com | 72. «9 


23. 


15 22 21 1 
16 —53 -59 -—31 
BA is not possible. 


25: 


27. 


29. 


St. 


33. 


35. 


37. 


39. 


41. 


43. 


45. 


0 0 4 
AB=|6 4 -2 
2 -4 -6 
2 -2 6 
BA= | 2 2 4 
4 0 —6 
21 —-17 —-5 
AB = } 19 5 19 
5 9 4 
19 5 23 
BA = 5 —7 -1 
—14 6 18 
4 -6 
a= [t =] 
4 8 
AD = lies ae 
2 2 2 
DA = 6 6 6 
—15 -15 —15 
2 -3 5 
AD=| 4 -6 10 
-—6 9 15 
dia dib dic 
DA = dod dze dof 
d3g d3h d3i 
dia d2b d3c 
AD=}|d;d doe df 
dig d2h d3i 
~ —6 
Ax = | 11 
—5 
AX = 5 
21 
X1 + 2x2 + 3x3 
Ax = x1 + 2x3 
2x1 + 3x2 + x3 
4 0 8 0 
pee “es 
m E 54 =k | 
o oOo 1 1 0 0O 
AA=]1 0 O|;44=]0 1 O 
Oo 1 0 0 oO 1 
0 —2 
@ [4] 
10 2 
(b) i a 
(c) —11 —-—15 
37 32 


(A+B) (A+B) = AA+AB+BA+BB = 
A? + AB + BA + B2. 
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Section 2.3 


ao fees oe 3 
vere} (4 


Sketches will vary depending on choice of 
origin of each vector. 


X+y 
y 9. ||x|| = V5; ||ax]| = 45 = 3V5. The 
xy vector ax is 3 times as long as x. 
x 11. ||x]| = V34; ||ax]| = 34. The vectors ax 
5 and xX are the same length (they just point 
in opposite directions). 
13. (a) {|x| = V2; |v] = V13; 
5 pasa a ae a see dla 
MN ae an ee (b) [|x|] = V5; [I] = 35; 
X+yi\| =4Vv5. 
Sketches will vary depending on choice of 5 inhi =o 
origin of each cor : (c) ad = 10; [I] = v29; 
X+yl|| = V65. 
: (d) |e] = V5; lll = 2V5; 
1 ae Xx+y\| = V5. 
gamed The equality holds sometimes; only when x 
and y point along the same line, in the same 
SKS direction. 
X+y 
Be EA y 
y Ay rN 
5. Sketches will vary depending on choice of 
origin of each vector. 
X+y a 
y = 
Ax 
x 
x 
15. 
of oy 
Yo, 
x 
x 
; fi 
17. ay 
7. Sketches will vary depending on choice of 
origin of each vector. Section 2.4 
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SOP SGU QS 


11. 


13. 


15. 


17. 


19. 


21. 


Multiply Ad and Av to verify. 
Multiply Ad and Av to verify. 
Multiply Ad and Av to verify. 


Multiply Ad, AV and A(iu + V) to verify. 


Multiply Ad, AV and A(u + V) to verify. 


w( 
. | 2/5 
1b) EAA 
we 
, [ -2 
he 4 | 
5/4 
(a) X=x3] 1 
1 
1 5/4 
(b) x= | 0] 4x3] 1 
0 1 
14 
(a) ¥ = x3 | —10 
) 
14 
wo =| }|+* —10 
0 
2 -1 
(a) F=H3 | 779 | 44 | 27° 
) 1 
(b) ¥ = 
+ X4 


1 
(a) K = x2 0 + x4 
0 
0 
13/2 
0 
XS —2 
0 
1 
26 = 
bor! . !') 
(b) X= | 3/2 | +x2 
0 
0 


23. (a) K=xq 


x4 


1/2 13/2 
0 0 
—1/2| +x5 | -2 
1 0 
0 1 
1 0 
13/9 -1 
—1/3 | +x5 | —1 
1 0 
0 1 


< 


> X 


9 X= 


11. X= 


Section 2.6 


1-9 
-4 5 
—2 -7 
7 -6 
-5 2 -3 
-4 -3 -2 
0 

3-1 

3 -3 3 
2-2 -3 
33-21 22 
5/3 2/3 1 
-1/3 1/6 0 
1/3 1/3 0 
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[—24 —5 
A 5 | 
[1/3 0 
3. 
0 41/7 
5. A—! does not exist. 
f1 0 
a | 0 | 
—5/13 3/13 
9. 
| 1/13 2/13 
—2 1 
Ye mA 
1 2 —2 
13. 0 1 —3 
6 10 —5 
1 0 0 
15. 52 -48 7 
8 —7 1 
17. A~1 does not exist 
25 8 0 
19. 78 25 O 
—30 -9 1 
Oo 1 0 
21. Oo oOo 1 
1 O 0O 
1, 0 0 0 
-3 -1 0 -4 
a3: —35 -10 1 —47 
—2 —2 0 -9 
28 18 3 —19 
5 1 0 —5 
sa 4 5 1 0 
52 60 12 —15 
[° 0 1 | 
0 0 O0O 1 
ads 1 0 0 0 
o 1 0 0 
= 2 
29. X= BI 
= —8 
31. X= | 1 
—7 
33. X= 1 
-1 
3 
35. x=] -1 
-—9 
Section 2.7 
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(AB)~* = | ue 2 | 
wor=[ 26, 2] 
At= | 7 = 


. Solutions will vary. 


11. 


Likely some entries that should be 0 will not 
be exactly 0, but rather very small values. 


Chapter 3 


Section 3.1 


1. 


Ais symmetric. | mm 


3. Ais diagonal, as is A’. | | 


5. 


11. 


13. 


15. 


17. 


19. 


-5 3 —-10 
-9 1 -8 
4 -9 


7 6 
-4 3 
-~9 -9 

7 

8 


—9 6 


—8 
4 —3 1 
10 —7 -1 


4 Oo -2 
Ais symmetric. 0 2 3 
2 3 6 
2 5 7 
—- 5 -4 
—3 -6 —-10 
4 5 —6 
2 —4 6 
-—9 -—10 9 
Ais upper triangular; A’ is lower triangular. 
—3 0) 0 
—4 -3 0 
—5 5 -3 


21. Aisdiagonal,asisA’.] 0 2 O 


0 oO -1 
-1 2 
23. Ais skew symmetric. 1 0 —4 
—2 4 #0 
Section 3.2 
1. 6 
35-33 
5. —9 
Ye 
9. Not defined; the matrix must be square. 
11. —23 
13. 4 
15. 0 
17. (a) tr(A)=8; tr(B)=—2; tr(A + B)=6 
(b) tr(AB) = 53 = tr(BA) 
19. (a) tr(A)=—1; tr(B)=6; tr(A + B)=5 
(b) tr(AB) = 201 = tr(BA) 
Section 3.3 
1. 34 
3. —44 
5. —44 
7. 28 
9. (a) The submatrices are 2 ‘ 
6 10 
3 6 3.7 47 
|; fo [rand /; Al 
respectively. 
(b) C12 = 34, C12 = —24,Ci3 = 11 
11. (a) The submatrices are E eal 
[ss ral and | 3 | 
-9 9)’ -9 3)’ 
respectively. 
(b) C12 3, Cir = —63, C1,3 = 18 
13. —59 
15. 15 
17. 3 
19. 0 
21. 0 
23. —113 


25. Hint: Ci.1 = d. 
Section 3.4 
1. 84 
3. 0 
5. 10 
7. 24 
9. 175 
11. —200 
13. 34 
15. (a) det(A) = 41; Ro + R3 
(b) det(B) = 164; —4R3 —> Rs 
(c) det(C) = —41; Rp + Ri > Ri 
17. (a) det(A) = —16; Ri +> Ro then 
Ri © R3 
(b) det(B) = —16; —Ri — Ri and 
—R> > Ry 
(c) det(C) = —432;C =3%*M 
19. det(A) = 4, det(B) = 4, det(AB) = 16 


21. det(A) = —12, det(B) = 29, 
det(AB) = —348 
23. —59 
25. 15 
27. 3 
29. 0 
Section 3.5 
1. (a) det (A) = 14, det (A1) = 70, 
det (A2) = 14 
> 5 
a 
3. (a) det (A) = 0, det (Ai) = 0, 
det Az) =0 
(b) Infinite solutions exist. 
5. (a) det (A) = 16, det (A1) = —64, 
det (Az) = 80 
“4 —4 
we-(3] 
7. (a) det (A) = —123, det (Ai) = —492, 
det (A2) = 123, det (A3) = 492 
4 
(b) X= | -1 
—4 
9. (a) det (A) = 56, det (Ai) = 224, 


det (A7) = 0, det (A3) = —112 
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4 
(b) x= | O 
—2 
11. (a) det (A) = 0, det (Ai) = 147, 
det (Ar) = —49, det (A3) = —49 
(b) No solution exists. 
Chapter 4 
Section 4.1 
1.A=3 
3. =0 
BASS 
ey pt 
ela 
3 
9.x=|-7 
7 
-1 
11. X= 1 
1 
ois cad 9 
13. A1 = 4withx; = ek 
2 = Swithx = H 
So oes —2 
15. Ai = —3 withx; = | | 
A2 = 5 withx> = H 
Page: 1 
17. A1 = 2 withx; = EE 
aio -1 
da = 4 with = | 
fe 1 
19. Ai = —1 withx; = EF 
A2 = —3 withx, = ‘ 
2 2=]69 
—3 
21. A. =3withx; = | O |; 
2 
—5 
A2 = 4withx, = | —-1 
1 
1 
3 = 5withx3 = | 0 
0 


24 
23. Ay = —Swithx; = | 13 |; 
8 
6 
dA2 = —2 withxo = | 5 
1 
0 
A3 = 3withx3 = | 1 
0 
0 
25. \y = —2withx; = | 0]; 
1 
0 
d2 = 1withxs = | 3 
5 
28 
A3 = 5withx3 = | 7 
1 
1 
27. 4) =—2withx= | 0]; 
1 
1 
A2 = 3with¥ = | 1]; 
1 
0 
A3 = 5withX = | 1 
1 
Section 4.2 
aie [sd 
1. (a) A1 = 1withx; = ft 
Gan Be: 
A2 = 4withx2 = | 
ee amen fe 
(b) Ar = 1withx; = 1 | 
A2 = 4with x; = ie 
2= 2= | 4 
ee 1 
(c) Ar = 1/4 with xy = Bi 
A2 = 4with x = al 
(d) 5 : 
(e) 4 
3. 


Gr ie we ae 
2 = Owith xo = EF 
(b) dy = —2 with i = | 
1 

5 


A2 = Owith x = | 


(c) Ais not invertible. 


5. 


(d) -1 
(e) O 


—7 
(a) Ar = —4 with x7 = E : 
0 
A2 = 3withx, = | O 
1 
9 
1 


A3 = 4with x3 = 


N 


2 
-1 
(b) Ar = —4 with xj = 9 |; 
A2 = 3 withx = 


A3 = 4with x3 = 


| 
=< —— Ch 
aes 
XS 


A2 = 1/3 with x = 


—7 
(c) Ar = —1/4 with xj = EF : 
0 
0 
1 
9 
A3 = 1/4 with x3 1 


(d) 3 
(e) —48 


Chapter 5 


Section 5.1 


11. 


fo -1 
A= L -—1 =| 
Yes, these are the same; the transformation 
-1 0 


matrix in each is 
0 -—1 


13. Yes, these are the same. Each produces the 
; : es | 
transformation matrix ; 
0) 3 
Section 5.2 
1. Yes 
3. No; cannot add a constant. 
5. Yes. 
1 2 
7. ([T]=]3 -5 
0 2 
1 0O 3 
9. ([T]=]1 0 -1 
1 0O 1 
11. [T]=[1 2 3 4] 
Section 5.3 
3 -1 
1. ¥+y=]2),K-y= | -4 
4 0 
Sketches will vary slightly depending on 
orientation. 
Yeo 
4 2 
3. X+¥=]4),%-y= 2 
8 —4 


Sketches will vary slightly depending on 
orientation. 
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Chapter A Solutions To Selected Problems 


2x 
¥-¥ 
K+Y 
Zz 
vy 
y 
x 
7. Sketches may vary slightly. 
Zz 
—>y 
K+ “ 
x 
¥-7 
2x 


9. ||x|| = 30, ||ax|] = 120 = 2/30 
11. ||X|| = /54 = 36, 
||ax|| = 270 = 15V6 


5. Sketches may vary slightly. 
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Index 


antisymmetric, 128 inconsistent, 24 
augmented matrix, 8 inverse 
computing, 107 
basic variable, 27 definition, 106 
Invertible Matrix Theorem, 113 
characteristic polynomial, 168 properties, 113, 119 
cofactor, 137 uniqueness, 106 
expansion, 139, 146 Invertible Matrix Theorem, 113, 155, 183 
consistent, 24, 88, 90 
Cramer’s Rule, 159 leading one, 14, 27, 29 
linear equation, 3 
determinant linear transformation 
3 x 3 shortcut, 155 and 0, 208 
and elementary row operations, 150 conditions on, 213 
definition, 141 definition, 204 
of 2 x 2 matrices, 136 standard matrix of, 207, 209 


of triangular matrices, 148 


properties, 154 matrix 
diagonal, 118 addition, 47 
definition, 123 arithmetic properties, 49 
augmented, 8 
eigenvalue cofactor, 137 
definition, 164 definition, 7 
finding, 168 determinant, 136, 141 
properties, 182 diagonal, 123 
eigenvector, see eigenvalue equality, 46 
elementary row operations, 13 identity matrix, 61 
and determinants, 150 inverse, 106, 107 
minor, 137 
free variable, 25, 27 multiplication, 54 
properties, 62 
Gaussian elimination, 15, 18 scalar multiplication, 47 
backward steps, 17 the zero matrix, 49 
forward steps, 17 transpose, 122 
triangular, 123 
homogeneous, 87, 88, 90 minor, 137 
identity matrix, 61 Parallelogram Law, 69 
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Index 


particular solution, 30 column, 52 
problem solving, 37 length, 74, 224 
pseudoinverse, 130 row, 52 
reduced echelon form, 14 zero matrix, 49 


reduced row echelon form, 14 
row echelon form, 14 


skew symmetric, 128 
definition, 128 
theorem, 129 

solution, 3 
infinite, 24, 27, 90 
none, 24 
particular, 30 
types, 24 
unique, 24, 90, 110 

standard unit vector, 208 

symmetric, 128 
definition, 128 
theorem, 129 

system of linear equations 
consistent, 24, 27, 88, 90 
definition, 3 
homogeneous, 87 
inconsistent, 24, 29 
solution, 3 


trace 
definition, 131 
properties, 133 

transpose, 121 
definition, 122 
properties, 126 
skew-symmetric, 128 
symmetric, 128 

triangular matrix 
definition, 123 
determinant, 148 


variable 
basic, 27 
dependent, 27 
free, 25, 27 
independent, 27 
vector 
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